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Abstract 
This thesis comprises three self-contained chapters. Its main object of interest is the 
wage returns to education and experience on labor markets, and the earning inequalities 
these returns generate. To understand the origin of returns to education and experience, 
and why they vary across time and space, this thesis adopts a matching perspective: 
it investigates the determinants of relationship formation between workers/employees 
and firms/employers. Employer-employee relations rest on two elements: first, the 
profitability of the relation, i.e. how much a worker and firm can produce together, 
and second, relative supply and demand: how many workers are available to work in 
firms and vice-versa. Because they determine which relationships are formed, these 
two elements eventually impact wages paid by firms to workers. The first chapter 
documents flattening wage returns to experience between higher education graduates 
entering the French labor market in 1998 and 2010. I decompose differences in average 
wage growth by occupation into an extensive (a composition effect across occupations) 
and intensive margin (a variation in wage growth within occupations). I then study two 
mechanisms behind the wage growth slow down: access to managerial positions and 
impact of initial match quality. I find access to managerial positions is more infrequent 
for recent cohorts. I also find that initial match quality has not worsened between the 
1998 and 2010 cohorts, but its impact on future wages has become more enduring. 
The second chapter studies the interplay between worker supply and firm demand, 
and their effect on sorting and wages in the labor market. Specifically, I investigate 
a decrease in the education wage premium on the Portuguese labor market between 
1987 and 2017. I build a model of one-to-many matching with multidimensional types 
in which several workers are employed by a single firm. I structurally estimate the 
model on matched employer-employee data. Counterfactual exercises suggest that both 
changes in worker preferences and the increasing relative productivity of high school 
graduates over non-graduates act as a mitigating force on the decreasing high school 
wage premium, but do not fully compensate for high school graduates’ rise in relative 
supply. In the third chapter, co-authored with Jeremy Fox and Alfred Galichon, we 
explore how expectations on future returns influence matching decisions. We introduce 
a model of dynamic matching with transferable utility. We explore aggregate dynamics 
and show that a stationary equilibrium exists. We propose two algorithms to compute 
a stationary equilibrium, and adapt both for estimation. We then use the methods 
developed to estimate a model of geographic mobility costs for Swedish engineers. We 
find that mobility costs impose a sizeable penalty in match production, and evolve 


non-linearly by age. 


Acknowledgements 


My most grateful thanks go to my advisor Alfred Galichon. I first met Alfred when I 
became his research assistant during my masters in Sciences Po, and was immediately struck 
by his intelligence and enthusiasm. Alfred showed me how powerful a tool convex optimiza- 
tion could be in economics, and has inspired me to pursue a PhD under his supervision. 
Throughout my PhD, he has been very generous with his time and has provided me with 
invaluable guidance and support, both on general and technical subjects. I am very much 
indebted to him. 

I am also very grateful to my advisors Jean-Marc Robin and Moshe Buchinsky. They have 
both taken the time to follow my work and give me valued feedback and advice throughout 
my PhD, from which my research has substantially benefitted. Both their support has been 
especially precious during my job market. 

I would like to thank Zsofia Barany and Kerstin Holzheu for their constant encourage- 
ments and feedbacks. This dissertation owes a lot to their expertise in Labor economics and 
Macroeconomics. I also want to thank Pierre Cahuc for taking the time to read my job 
market paper, which has been tremendously helpful, and Florian Oswald for providing me 
with insightful feedback and encouraging me to hone and improve my computational skills. 

I would also like to take this opportunity to thank Jeremy Fox and Arnaud Dupuy who, 
together with Jean-Marc, Moshe and Zsofia, have agreed to be part of my jury. I am very 
happy to have worked with Jeremy as a co-author: I have learned so much at his contact. I 
very much look forward to working with Arnaud, as I start a postdoc at the University of 
Luxembourg next year. 

My work has also benefitted from the many perceptive comments of the faculty members 
at the Department. In particular, I would like to thank Ghazala Azmat, Nicolas Coeurdacier, 
Jeanne Commault, Michele Fioretti, Guy Laroque, Marleen Mara, Isabelle Méjean, Stefan 
Pollinger, Thierry Mayer and Xavier Ragot. 

I would also like to thank the administrative staff at Sciences Po: Florence Avisse, Cathy 
Benard, Pilar Calvo, Séverine Germain, Claudine Lamaze, Melissa Mundell, and Hadjila 
Nezlioui-Serraz, thank you for you availability and efficiency in dealing administrative issues. 
Special thanks go to Sandrine Le Goff for overseeing traveling and accommodation, always 
finding an available room when we need it, and organizing the Lunch Seminar! Thanks also 
go to Guillaume Sarrat de Tramezaigues, for making the practical side of this PhD a smooth 
experience. 

I have had a wonderful time at Sciences Po throughout my PhD thanks to its many 
smart and friendly PhD students: my own PhD cohort, Edgard, Daniel, Jérome, Pierre, 


Stefan Victor, and Zydney; my job market partners: Alais, Ariane, Clément, Julia, Oliver 


5 


and Vladimir; and everyone who made our open space the combles a great place to work: 
Alexis, Clémence, Daniel, Gustave, Juan, Leonard, Marco, Moritz, Mylene, Naomi, Nourhan, 
Riddhi, Samuel, Ségal, Valentin and Victor. There was always someone in this joyous crowd 
ready for a brainstorming or a drink, and it has greatly contributed to the completion of this 
thesis. I wish everyone all the best. I would also like to thank the now-doctors who defended 
their thesis during my time here, their advice early on in my PhD and their career success 
have been most inspirational: Aseem, Camille, Charles, Elisa, Edoardo, Julien, Ludovic, 
Nicolo, Sophie and Tyler. 

Writing a doctoral thesis can be a nerve-racking experience, and I am grateful for all my 
brilliant friends who reminded me daily that life is not only about work: Howon, Grégory, 
Thomas, Christina, Henrik, Anita, Léa and all the ‘Scubes’ with whom I started my edu- 
cation at Sciences Po more than ten years ago; Mélina, Patrick, Cécile, Rancy, Léa, Max 
and Guillermo who can always be counted on to vent about their own PhDs around a good 
dinner; Laurene, Robin, Kevin and John, who do not despair of improving my theatrical and 
cinematographic culture; Alice, Florestan, Clotilde, Sophie, Laurence, Manon and Manon 
who are always ready for a good laugh when I visit Belgium; and the friends that live further 
away but are not less dear: Flora, Laura, Pierre, Francois, Arnaud and Natalie. 

I would also like to thank my parents, Sandrine and Charles, and my brother, Alexandre, 
for their constant love, support and interest in my work. My mother predicted I would 
be a researcher when I was five, and as always she was right. I am also indebted to my 
grand-parents, Patrice and Gérard, my aunt and uncle, Ariane and Antoine, and my cousins 
Danaé, Olympe and Hector, for welcoming me with open arms when I moved to Paris eleven 
years ago, and caring for me since. I also thank my cousins in La Réunion for continuously 
inviting us for much needed vacations in the sun, and my family-in-law for introducing me 
to the joys of cross-country skiing. I am also grateful for the encouragements and love of my 
grand-mother Monique and grand-father Jean-Claude, who is dearly missed. 

Finally, I would never have completed this thesis without the unbending support and 
faith of my partner Thibaut. He has lead the way into the PhD venture, and has helped 
me progress in so many ways since: by tipping me on algorithms, constantly refactoring my 
code, and proofreading this thesis, to only cite a few. I am extremely lucky to have him in 


my life, and I cannot wait to start the new phase of our life together. 


Declaration 


My work has received the support of the European Research Council (ERC Consolidator 
Grant no. 866274). 


Contents 


List of Figures 13 
List of Tables 14 
Introduction 17 

IRGLErEN GOS! x5 se is bye Steck: Seg Sk ce YY la Shae HY ed Qn Oe Ba oe eh a oe sa 31 


1 The Flattening Returns to Experience for Higher Education Graduates in 


France Al 
1 I HITOGUCHOMs ye bocce Mice le con See & evn Avsa SEs 2 ee Soha oe 42 
2 Data aud. Pinpirital Spacey oo acid ad elke he ee nd ee Sole Me Bia 46 
2 The French labor market between 1998 and 2017 ........2... 46 
22. HS Aa os 20 ee cas eS a ee a Gs ae ee eae ed 48 
2.3 ULL C ON see oh 2 os cy Mee et BE a oe cee ee es a oe eae he oe ahs 50 
3 Presi Si) 2.2 oe BGG k. Gerad og ae Ak Soe ee ae Bae te nog ee noes 53 
Whechianisiiiss dsc oe oe ke OR AS ee Se a ee G at Bebee eA San aoa ed 58 
4.1 Promotion to managerial positions ................... 59 
4.2 Major-Occupation match quality ..................0.. 64 
oO: “RODMSUTESS bests: sh sue sce ote coo Gs ews ae a BOS oe Rien ed Pe Gabe ed 69 
al Sample Tepresentarwvieys mad ae ete ae ee ake ee eke ee es 69 
5.2 Unobserved heterogeneity .........0-..0-0 0002 ee eee 71 
5.3 WiatChIne Os Gy”. ss, so2h a a os etapa ad EOS a Bread ek Gms ee 74 
6 CONGINSION ia... ae Sn ra ae st Be eh ee ee ee ee eee ee She 76 
PSeTenCC si 20st ok kB Oe mhed Oren a gree he A oes ey oe Be Bak tel hang ig tains 78 


2 Education Expansion, Sorting, and the Decreasing Education Wage Pre- 


mium 81 
1 Tnitiroduction «2.0 28 Se ne a ek ee ee 8 82 
2 MOG e x. toca te, dh SoA de 5 A hee AS Bh thee Meee a Metron dn Rial Vad Riate Ne Gel qaait Ne he 87 


21 COUP od Sek oh the Be ath tek nln ie be een ee ee See Se 88 


2, Competitive Hquilibritim 2.3. 20.%¢2% 6-2 ok ee ee ew eS 90 

2idk Links with search and matching models in the literaure ........ 94 

3 Bipirical Evidence > aes 4c pote gs ei See ee OR BE oe eS 95 
3.1 Date: Deschpudn, ¢ 22%. Wea Since eh eke ed 2 oe oto a oe 95 

a2 Pr iCal TACUey SiMe Go Pe We ty els nang ahs i wee me OP eh os Oe 96 

4 Identification and Estiniation «af 4-4 aa ee aS Oe a AS eae PE ES 100 
9) FReSUH S's: 2). SSeS Sle Se ek aS ee ew OE PE CRS 105 
5.1 Parameters Cabumales. tees ty etl rer e.S eae elena oe & So Soe ore 105 

5.2 Wide WE TOMIC HONG at-6-% babe Gr ee er eh Ee ee 109 

5.3 Coumtertattisis: 2272.28 feed 3 he ee AE owe oe 110 

6 COMMISION 2 3260 ta 3.5 Geta toe Te ine thee eae ed ee OPE Soe Hee 113 
ATCT ONC CG. bo ese te See YS che nok Sy okend Gs @rke ek E Goks deh OGL So Oe LS 
KB Waa. ook ae Sree OP eee Bee A he 119 
B (Details-onPrpinical Wats: «ves 6 als iets ew elie ea KG ee te 120 
Bae “Wage levels: by:pender -i S33 ey eon ek Ree ee RS SS es 120 

B.2. The university wage premium ...............-.2+000- 121 

Crs VRTOO Sa, bo ere yy eit te Bais ene te ee ean aides sea sie a We ee Se 122 
Pye SUSUR IO a6 359 odes Sy she ot Bg, De che ahh' ey h E Ges deeh See es Pe 125 
Di ‘Workforce Discretizaiom. 4-0-4 4:5.4.% he oan Aug lade aed bs 125 

ID - Aa AL COriGhiih, «3. o wast-e se bs etsy arc el de ear ald ea ee oe we 126 

Do aikelihGod etadient: 2.6 3 they ok ee er ee Se Re LO 

BM -ounperiactiale 602 3 kn hoe hota Aud! os hae ale ae la ow ed os 128 
F Comparison to Card & Lemieux’s model ..............0.0 2004 130 
3 Repeated Matching Games: an Empirical Framework 135 
1 WO GUGTIOT 5 fateh en eh eS cn, set aad ks ap i ee DN ce as oh Sd Ae Be etd RN: 136 
Z The Model Without Econometric Errors ..............-.-2004- 139 
2:1 Agents and Economywide Variables ................... 139 

2:2 Individual Agent Problems..................2-.2004. 141 

2d Competitive: Equiibrinir 2.0: 4 aoe are Bae ae Me SS bas Pad 142 

2.4 Shalionary BQuiiOniny. < ein ue dee ln ad oe eR DD oD A A 144 

2.9 Finite Horizon for the Economy .................2+4- 145 

3 ‘The: Model With Econometiic: rrr: o.4: 2 ace Gk. ae ee de ek BOS Sk 145 
ool Econometric Preference SHOCK «4406 ee owe be De EA Me BAO 146 

Soe Competitive Equilibrium With Econometric Errors .......... 147 


10 


Be, Stationary Equilibrium With Gumbel Errors .............. 149 


4 Methods for Equilibrium Computation ..................04. 150 
4.1 Equilibrium with Aggregate Dynamics ................. 150 

4.2 Stationary Equilibrium with Constant Aggregate State ........ 153 

5 jBanpirical Application, 8.06 2 4 hoe che EE a OR oe a Sn 158 

6 COMCHISIOD: goos 1482 alent OS ea ee tele een orks an ede Ue tee EP 161 
PREROTONICCS AS Mates, Ne Be A he wl as thee hs Ati ke ed Seah St od Sema Se Es Boa at Ne eS 162 

A, JPTOOIS> fa: Gnd Sa eee ye Se St eS ek aE ee a toe th 165 
Conclusion 173 
Résumé 179 
51H 6) Nf ckigs1| 6 nbs aaa ee a ae a Ree eR ec eR 189 


11 


List of Figures 


bet 
12 
1.3 


1.4 


2.1 
2.2 
2.3 
2.4 
2.9 
2.6 
20 
2.8 
29 
2.10 
26h] 
212 


2.13 
2.14 


Average entry wage over time in euros, by cohort and education level .... 51 
Average wage growth decomposition by education level and occupation ... 57 
Average wage growth decomposition by education level and disaggregated 


occupation - All higher education graduates ..............008. ay 
Average real wage at employment spell’s end, by cohort and education level. 71 
High school graduates versus non graduates relative supply, by age group .. 97 
Estimated high school graduate wage premium by age group ......... 98 
High school - Sector sorting, by age group ..............+.224- 99 
Estimated education productivities ... 2... 2... ee 105 
Estimated age productivities . 2... 106 
Estimated worker preferences .... 2.2.0... 0 eee ee 107 
Estimated elasticities of substitution ...... 2.0.0... 00000. eee 107 


Average wage by education level and age group - Observed versus Predicted 110 


Average wage by education level and age group- Men. ............ 120 
Average wage by education level and age group - Women ........... 121 
University wage premium by age group ................-.004- 121 
Firm distribution by number of high school graduates and non graduates 

eiiplayete 5's at aS aS cee ed hE ee A ea ao t 125 
Worker type distribution and clusters, 1987-1989. ............... 126 
Predicted wage gaps between high school graduates and non-graduates of same 

ACCS toes Ce ae Cole oe oe Gul ge a, Ra at gee ewe aed ae a A a toe Bh a te 133 


13 


List of Tables 


bet 


Le? 
1.3 
1.4 
1.5 
1.6 


1.7 


1.8 
LY 


1.10 


Lelt 


1.12 


1.13 


1.14 
1.15 


1.16 


Ly 
1.18 


Zol 


Education Levels by Occupations within French active population in 1999 and 


QUAL) sige. te Atay tet We eet ws dp Ra ie dk: Bi shee Yes BT ale Gece Gl Mies eee ae 47 
Number of individuals and spells by cohort. .................. 48 
Age, gender, and individual number of spells by cohorts... ......... 49 
Fiducation: level shares by cohort <<. 6 else Sew eds Se P el eee oe es eS 50 
QCecupation Shares by cohort s-4. 38 ae 2 ek he Se ek BR ts ay Ba 50 


Log entry wage regressed on number of years spent on the labor market by 
education level, with gender, location and industry fixed effects. ....... 54 
Log entry wage regressed on number of years spent on the labor market by 
education level and occupation, with gender, location and sector fixed effects 54 
Share of managerial posisitions obtained by cohort, year 1-2 and year 7-8 .. 60 


Log entry wage regressed on dummy for managerial position by education 


level and occupation, with gender, location and sector fixed effects... ... 61 
Chances of obtaining a managerial postition by years spent on the labor mar- 

ket, by education level and occupation ............. 000000 0 62 
Degree specialization shares by cohort... . 2... 0.2... eee ee es 65 


Match quality: median and interquartile range by cohort and education level 66 


Log entry wage regressed on match quality by year and education level - Long 

higher education ‘Craduates! os. 6-6-4aGe Atak: biel ate hele Alok gl Alea A 68 
Average observed wage at end of survey, by cohort and education level ... 70 
Log entry wage regressed on dummy for grade repeat and years spent on the 

laborimarket: by educationslevel .-s-4-2: 4A Son ae Abia Ae a ERE. & 72 
Degree type shares among long higher education graduates .......... 73 
Log entry wage regressed on years spent on the labor market by type of degree 74 
Log entry wage regressed on alternative match quality by year and education 

level - Long higher education graduates... 2... 0... ee ee 76 
Sorting average yearly percentage growth - Observed versus Predicted... . 109 


14 


Dee 
2.3 
2.4 
2.9 
2.6 
2f 
2.8 
29 
2.10 
aL) 


3.1 
3.2 
3.3 
3.4 


Changes in Sorting - Predicted versus Counterfactuals............. i i 
Changes in Wage Premium - Predicted versus Counterfactuals ........ L138 
Changes in Sorting in 16-34 age group - Predicted versus Counterfactuals . . 128 
Changes in Sorting in 35-54 age group - Predicted versus Counterfactuals . . 128 
Changes in Sorting in 55-68 age group - Predicted versus Counterfactuals . . 129 
Changes in Wage Premium in 16-34 age group - Predicted versus Counterfactuals 129 
Changes in Wage Premium in 35-54 age group - Predicted versus Counterfactuals 129 
Changes in Wage Premium in 55-68 age group - Predicted versus Counterfactuals 130 
Estimated age productivities and elasticities of substitution - Reduced Form 132 
Estimated education productivity growth and elasticity of substitution - Re- 

diced Ferity 2 g/g: 2%. an Ek ae OE chee ek ae DST ae RE BIOS SERS 132 


Equilibrium computation: methods’ speed depending on the number of types 157 


Estimation: methods’ speed depending on the number of parameters .... 158 
Estimation: methods’ estimation precision depending on sample size... . . 158 
Estimates for moving cost by age bin .............2.-2022000- 160 


15 


Introduction 


Labor markets are a pervasive aspect of 21°* century Western societies. Every single 
individual in Europe and the United States has at some point in their life an experience on 
labor markets, either by looking for employment or being employed, and scarcely a day goes 
by without media and governments scrutinizing and dissecting labor trends. Of particular 
interest are the unemployment rate and wage levels, which are both viewed as informative 
health measures of the economy. In the academic field, a large branch of economic research 
is also dedicated to labor markets, and questions pertaining to the impact of trade and 
globalization, the workforce’s educational composition, the setting of wages, and the role of 


labor market institutions (among others) provide lively research areas. 


This dissertation aims at understanding labor markets from a microeconomics point of 
view: it models both workers’ and firms’ individual choices on the labor market as a utility 
(for workers) or profits (for firms) maximizing, which at the aggregate level translates into 
workers’ supply of labor and firms’ demand for labor. It then uses the supply and demand 
framework to explain the drivers of wage setting. As such, it falls within the established 
tradition of neoclassical labor economics, in which wages are equilibrium quantities deter- 
mined by supply and demand. However, it departs from the baseline theory as presented by 
Hicks (1932): by accounting for individual idiosyncrasies, introducing heterogeneous workers 
and firms, and recognizing imperfect competition on labor markets, this dissertation reckons 
with a richer wage-setting framework than the standard view that wages are simply equal 


to the marginal product of labor. 


In this dissertation, I choose to focus on an important aspect of labor markets, namely 
wage returns to education and experience. It is generally observed that workers who have 
attained higher education levels, such as high school or university graduates, are paid higher 
wages than workers who have not. Workers who have more years of experience in the labor 
market are typically also paid more than their less-experienced peers. Returns to education 


and experience often combine, in favor of educated and experienced workers. However, the 
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wage gaps between educated and uneducated (or less educated), and experienced and inexpe- 
rienced workers vary across time and space. Understanding why these gaps exist in the first 
place, and why they differ by decade and region is essential for two reasons. First, because 
they are important drivers of inequality: in 2016 wages accounted for 55% of total household 
income in France, and 72% in the United States (Rani and Furrer (2016)). Because wages 
are the main source of total household income in most countries, wage inequality across 
households translates into income inequality (Autor (2014)). This is especially true among 
the first 90"" percentile of the income distribution: capital income is concentrated at the top 
of the income distribution and is the main driver of inequality between the poorest 90% and 
the richest 10% (Krueger et al. (2010)). The second reason why understanding wage gaps 
is necessary is that they inform us on how firms function. Indeed, how much firms agree 
to pay different types of workers depends on their needs: all other things equal, a firm is 
ready to pay much more a worker that possesses a set of skills crucial to its functioning than 
a worker who does not. The increase in the college wage premium (i.e. the average wage 
gap between workers who graduated from college and those who did not) between the late 
1970’s and early 2000’s in the United States, United Kingdom, and Canada as evidenced in 
Krueger et al. (2010), is convincingly interpreted by a large literature as the consequence 
of skill-biased technological change: a shift in firms’ production structure that made edu- 
cated workers more productive relative to uneducated workers (Katz and Murphy (1992), 
Acemoglu (1998), Goldin and Katz (2008), Autor et al. (2020)). More recently, the college 
wage premium trend has flattened, and it has even decreased in some European countries 
(for instance in Germany, Italy, Spain). This shift in trend is indicative of broad changes in 
the labor markets where it has occurred and could be either due to a change in demand that 
would go against skill-biased technological change or to a change in supply: both Europe 
and the United States have experienced large education expansions over the past 50 years. 
If supply changes are at play, even if firms’ demand for educated workers has persisted, it 
could be that their larger number is causing the decrease in relative wage. Understanding 
how supply and demand play out in determining wage gaps is central to economic policy- 


making and should shape education and production policies !. 


Being able to tell apart demand and supply effects’ on wage levels is at the heart of this 
dissertation. To do so, it develops a set of modeling and data tools that borrow from various 


branches of the economics literature. First, I use ordinary least square regressions in the 


'For instance, the French high school curriculum was reformed in-depth in 2019, leading to a strengthened 
specialization of students. Whether this will prove a strength for new graduates in the labor market remains 
to be assessed. 
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first chapter of this dissertation to assess the average impact of one or several explanatory 
variables on an outcome variable. Second, I rely on the structural econometric literature and 
the matching literature to build a novel model of matching between workers and firms in the 
labor market. The model can be either static, as in the second chapter, or dynamic with 
forward-looking agents, as in the third one. In both cases, the model incorporates workers 
who maximize their utilities, and firms who maximize their profits, with wages acting as 
equilibrium transfers. Importantly, both workers and firms are heterogeneous in multiple 
dimensions: education, age or occupation for workers, sectors or location for firms. Because 
workers perceive different amenities in different firms, and not all workers are equally pro- 
ductive in firms, the model is able to generate rich matching and wage distributions. Finally, 
I exploit high-quality matching data with rich information on which workers are employed 
at which firm and for what wage. Using this data, I am able to structurally estimate the 
aforementioned models using observed matching and wage distributions, which lets me back 


out the parameters of worker supply and firm demand. 


Related literature. This dissertation belongs to the large field of labor economics, which 
investigates workers’ and firms’ outcomes (wage, employment, etc.) on labor markets. It 
is related to several strands of the literature: first to the broad body of research on the 
mechanisms of wage setting, second to the theoretical matching literature, and third to the 
literature studying returns to education, and how they have changed over time and across 
economies. 

Investigating determinants of wage is central to the field of labor economics, and I give 
here only a partial view of the extensive corpus of research on the topic. The neoclassical 
view, as described by Hicks (1932), is that in the simple setup where a representative firm 
seeks to maximize profits by hiring its workforce, workers’ wages are equal to their marginal 
product in the firm. This simple model of wage setting can be augmented by accounting 
for workers’ diverse education or experience levels, as in Katz and Murphy (1992) or Card 
and Lemieux (2001), or by assuming inelastic labor supply, as Manning (2021) describes. 
An important alteration of this view, which took place in the 1970s and started the search 
literature (McCall (1970), Mortensen (1970)), is to acknowledge the existence of frictions in 
workers’ search for a job and firms’ search for an employee: a worker does not immediately 
find a job the moment she enters the labor market, and a firm cannot instantly hire whoever 
is needed. The search literature provides frictions as a rationale for inelastic labor supply, 
and they are usually modeled with a measure of randomness in the meeting process of 
workers and firms. Because agents are forward-looking, these frictions impact the setting 


of wage: when a worker and a firm meet they account for future search costs should they 
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refuse the present match. The first model to fully endogenize wage distribution in the search 
literature is the Mortensen-Diamond-Pissarides model (Mortensen (1982), Diamond (1982), 
Pissarides (1985)). It does so by decomposing the search into two steps: first workers and 
firms search for a good match, instead of simply having workers searching for a good wage. 
Wage is then bargained as a split of wage surplus. Another approach is wage posting, initially 
developed by Burdett and Mortensen (1998) is on-the-job search: even when they already 
have a job, workers keep looking for a potentially better one: this creates a job ladder with 
a full wage distribution. Both the wage bargaining and the wage posting views have since 
been extended and enriched since: Postel-Vinay and Robin (2002) augment wage posting 
with price discrimination from firms based on workers’ characteristics, Mortensen (2003) 
examines various wage-setting mechanisms in a model of on-the-job search, Cahuc et. al. 
(2006) augment the wage bargaining model with on-the-job search, accounting for firms’ 
offers and counter-offers. Another important contribution by Shimer and Smith (2000) 
introduces search frictions in the matching model of Becker (1973) and characterizes the 
assortativeness pf equilibrium. Shimer and Smith (2000)’s result has spawned a rich sub- 
literature in search-matching models: Lise et al. (2016) extend it to one-the-job search, 
Lise and Robin (2017) use the framework to understand how a non-)stationary environment 
impacts wage setting, Moscarini and Postel- Vinay (2018) emphasize the role of the job ladder 
created by frictions in amplifying aggregate shocks, Lise and Postel-Vinay (2020) considers 
multidimensional worker types and firms requirements. Another critical extension to the 
search model is directed search (Julien et al. (2000), Burdett et al. (2001), Menzio and 
Shi (2011)) that postulates that workers and firms are able to direct their search based on 
counter-parties characteristics, or in the case of wage posting models, workers direct their 
search depending on the wages posted by firms. See Hosios (1990) for an important link 
between wage bargaining models and efficiency in wage posting models. The directed search 
set up can be extended to simultaneous job applications, as in Albrecht et al. (2006), Kircher 
(2009) and Galenianos and Kircher (2009), or to job-to-job transitions (Garibaldi et al. 
(2016)), idiosyncratic risk (Schaal (2017)) or selective effort (Tsuyuhara (2016)). Directed 
search model are useful tools to understand wage rigidity (Menzio and Moen (2010)), business 
cycles (Menzio and Shi (2011)) or sorting (Eeckhout and Kircher (2011)). 

Parallel to the search structural literature, another view of wage-setting has developed, 
that does not focus on search frictions but instead on worker and firm heterogeneity, fol- 
lowing the seminal work by Abowd et al. (1999) (hereafter AKM), who empirically estimate 
the contribution of workers and firms to earnings dispersion. 'To do so, they use ‘mover’, 
workers who switch firms, which allows them to separately identify additive worker and firm 


fixed effects. They find that firms account for about 20% of the wage dispersion, so that 
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worker heterogeneity appears to be the main driver of the observed heterogeneity in wage. 
Following in their steps but with contrasting results, Woodcock (2008), Card et al. (2013) 
and Song et al. (2019) evidence the role of firm wage premium in the rising wage inequality. 
Importantly Abowd et al. (1999) rely on an exogenous mobility assumption, that is likely 
not verified in the data: workers’ movement between firms must be uncorrelated with their 
previous earnings. A large structural literature seeks to abstract from this assumption and 
also aims at capturing sorting, i.e. the combined effect of worker and firm heterogeneity on 
wage. It does so mainly by directly modeling sorting, as Hagedorn et al. (2017), Lopes de 
Melo (2018) and Bagger and Lentz (2019). Bonhomme et al. (2019) provide a comprehensive 
framework to reconcile the structural and empirical approach, which is used by Lentz et al. 
(2018) to evidence sorting patterns in Denmark. 

This dissertation shares the same goals as the aforementioned literature: to understand 
how wages are set in the labor market. The tools I used to explore this question are related 
to but different from the ones just described however: I choose to use matching models to 
describe labor markets. Instead of focusing on embedded frictions like the search literature 
does, or unobserved productivity, matching models put the spotlight on supply and demand 
effect, as well as worker and firm surplus derived from observable characteristics, such as 
education, experience, occupation or sector. 

The matching literature is split into two main strands: matching with transferable utility, 
for which a comprehensive theory is provided in Shapley and Shubik (1971) and matching 
with non-transferable utility in the framework of which Gale and Shapley (1962) developed 
the eponymous algorithm. The latter is well-suited to the analysis of problems such as medi- 
cal residency Roth (1984) or kidney exchange Ashlagi and Roth (2012). Menzel (2015) shows 
how to identify and estimate preference parameters on a large matching market with non- 
transferable utility. The former type of matching, with transferable utility, has first caught 
the attention of family economist with Becker (1973)’s seminal work on marriage markets. 
The union of marriage economics and matching theory has spawned a rich literature in family 
economics: since Choo and Siow (2006) proposed a structural model of matching between 
marriage partners with transferable utility to estimate partners’ utility parameters when 
marriage patterns are observed, many have followed suit: Dupuy and Galichon (2014) study 
mutual attraction between personality traits, Chiappori et al. (2017) develop a model with 
investment on children, Ciscato et al. (2020) compares patterns in homosexual and hetero- 
sexual marriages and Chiappori et al. (2020b), Chiappori et al. (2020a) survey the evolution 
of assortative matching in the US and the UK. Alternative approches to the Choo and Siow 
(2006)’s view include Chiappori et al. (2012) who simplify the parametrization of surplus 


to be able to estimate the model through reduced form equations, the approach developed 
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in Fox (2010b), Fox and Bajari (2013) Fox et al. (2018), that focuses on identifying the 
distribution of unobserved heterogeneity using several markets, and Sinha (2018), Gualdani 
and Sinha (2022) who aim their attention at identification of non-parametric models. Re- 
cently, the theory of matching with transferable utility has benefited from advances made 
in optimal transport (Villani (2009), Peyré and Cuturi (2020)). The link between matching 
and optimal transport is explicited in Galichon (2016) and Galichon (2021), and rests on 
Gretsky et al. (1992)’s duality proof. It can be summed up as a first welfare theorem: under 
a large market assumption, the equilibrium resulting from decentralized matching market 
with idiosyncratic heterogeneity is the same as the equilibrium obtained through solving a 
social planner problem that maximizes expected surplus with a penalty due to the idiosyn- 
cratic heterogeneity. It turns out this social planner problem is nothing different from a 
regularized optimal transport problem, and can be solved using techniques from this liter- 
ature (in most cases, the Sinkhorn algorithm). Also see Galichon and Salanié (2021) for a 
general exposition. On a related topic, the equivalence of matching markets and hedonic 
markets in the spirit of Ekeland et al. (2004) has been made by Chiappori et al. (2010), and 
the equivalence between matching and discrete choice a la Berry et al. (1995) is established 
in Bonnet et al. (2015). Matching with transferable utilities in the spirit of Choo and Siow 
(2006) is also used to understand labor markets: Dupuy and Galichon (2022) show how using 
matching and wage distribution separately identifies worker amenities and firm production. 
Dupuy et al. (2020) introduce a matching market with imperfectly transferable utility to 
account for taxation, Galichon and Hsieh (2018) develops a matching model with waiting 
queues, bringing the theory closer to search models, and Dupuy et al. (2021) study the job 
market of CEOs in Denmark. Another strand of the literature directly combines matching 
and search to understand sorting (Eeckhout and Kircher (2011)), building on the seminal 
work of Shimer and Smith (2000). Finally, labor markets are better understood through 
the prism of one-to-many matching, since the most common form of organization for a firm 
is to hire several workers. The first step towards one-to-many markets was first completed 
by Kelso and Crawford (1982) who propose a matching algorithm under the condition that 
workers are gross substitutes and Hatfield and Milgrom (2005) who develop matching with 
contracts with substitutable workers. Further development of one-to-many matching models 
was faced with a sizeable issue: without the restrictive gross-substitute condition, assignment 
stability could not be guaranteed. The problem has also been explored in auction theory by 
Bikhchandani and Ostroy (2002) and Vohra (2011), and has been recently solved for match- 
ing models through a large market assumption that allows for worker complementarity, see 
the recent survey by Azevedo and Hatfield (2018), and Che et al. (2019) on non-transferable 


one-to-many matching with worker complementarity. Recently, there has been a surge in 
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interest in how matching can explain labor market trends, for instance Lindenlaub (2017) 
explicit sorting on manual and cognitive skills, Choné and Kramarz (2021) study outsourcing 
of workers’ skills, and Boerma et al. (2021) examine a generalization of assortative matching 
on one-to-many markets. 

Finally, this dissertation is connected to the vast literature on the education wage pre- 
mium, that analyzes wage gaps between educated and uneducated workers. Many different 
angles and approaches constitute this literature, and I presently focus on two: the changes 
in skill/education wage premium, explained through skill-biased technological change, and 
life-cycle earning profiles, that focus on cumulative returns to education and experiences. 
The skill-biased technological change literature originally aims at explaining the increase 
in relative wages of skilled, or college-educated workers compared to their unskilled peers 
that occurred in the 1970s and 1980s in the United States. Its core idea is that production 
technology is changing at the time in a way that makes skilled workers increasingly more 
productive than unskilled workers, which, according to neoclassical wage setting, increases 
their wage compared to unskilled workers. Skill-biased technological change (SBTC) is first 
evidenced through estimation of the neoclassical firm production model on wage data by 
Bound and Johnson (1992), Katz and Murphy (1992), and then grounded into the growing 
computerization of the US economy (Krueger (1993), Acemoglu (1998), Autor et al. (1998)) 
and capital-skill complementarity Krusell et al. (2000). Acemoglu (2003), Bloom et al. 
(2016) also show how trade can induce skill-biased technological change. Writing on the 
consequences of SBTC Autor and Dorn (2013) show it not only stretches the wage structure 
between skilled and unskilled but also causes the polarization of the US economy, whereby 
low and high skilled occupations grow, while middle-skilled occupations decline. Polarization 
has also been shown to be caused by offshoring (Goos et al. (2014)) and structural change 
(Bardny and Siegel (2018)). After the wage structure stretch of the 1970s and 1980s, the 
education wage premium has flattened in the 1990s, which the SBTC hypothesis struggles 
to explain since computers continued to gain ground in that decade (Card and DiNardo 
(2002)). In European countries such as France (Verdugo (2014)) starting in the 1970s, the 
UK (Blundell et al. (2022)) in the 1990s, or Germany (Doepke and Gaetani (2020)) since the 
1980s the wage structure has condensed rather than expanded, which is explained through a 
combination of a supply effect, through education expansions, that overtakes the demand ef- 
fect from SBTC, and the role of labor market institutions and employment protection, which 
dampens the impact of SBTC. In the US, the reasons behind the flattening of the college 
wage premium are still debated, some authors arguing it is due to a decrease in demand for 
cognitive skills (Beaudry et al. (2015), Valletta (2016)), while others defend on the contrary 


that demand for cognitive skills has increased in recent years (Blair and Deming (2020)) 
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and that the Great Recession has accelerated SBTC (Hershbein and Kahn (2018)). Lindley 
and Machin (2016) nuance these views by showing that if the overall college wage premium 
stagnates, the postgraduate wage premium increases, suggesting SBTC now favors post- 
graduates. Deming (2017) also shows the rising returns to social instead of cognitive skills, 
and Deming and Noray (2020) focus on STEM works and exhibit the dampening of earning 
premium over time due to technology obsolescence. In a dynamic perspective, comparison 
of workers’ wage evolution over their lifetime by cohorts has evidenced a flattening of the 
life-cycle profile for recent cohorts (Manovskii and Kambourov (2005), Guvenen et al. (2017) 
Kong et al. (2018)), which is particularly strong for college graduates (Rothstein (2020), 
Ashworth et al. (2021)). The reasons behind the flattening of lifetime income are debated. 
Jeong et al. (2015) point to the effect of demographic changes. Another possible explanation 
is the scarring effect brought about by the Great Recession that started at the end of the 
2000s: a broad literature demonstrates the medium and long-term effects of exposure to 
bad economic conditions at the start of a career. Gregg and Tominey (2005) show youth 
exposure to unemployment carries a wage penalty twenty years later, Kahn (2010) focuses 
on college-educated workers and find persistent wage effects of graduating in a recession. 
In contrast, Brunner and Kuhn (2014) show blue-collar workers are penalized longer than 
white-collar workers because they have lower job mobility. Exploring the mechanisms behind 
the scarring effect, Liu et al. (2016) show skill mismatch between college graduates and firms 
increases during recessions, and explain career losses for ‘unlucky cohorts’. Berge (2018) 
shows the initial mismatch fades thanks to job mobility, which lets unlucky cohorts catch 
up on luckier ones. Exploring a different mechanism, Kwon et al. (2010) show the intensity 
of worker promotions is correlated with the business cycle, which explains more than half of 


wage cohort effects. 


The three strands of literature presented above all feed into the three chapters of this 
dissertation: chapter 1 empirically explores career mechanisms that result in different life- 
time income profiles by cohort, chapter 2 uses structural methods, namely matching models, 
that are particularly suited to understand supply and demand effects on the labor market 
at the aggregate level, to evaluate the simultaneous effect of an education expansion and 
technology change on wages, and chapter 3 develops a matching model to measure agents’ 
expectations of individual returns to changes in their type. A summary of each of the dis- 


sertation chapters follows. 


Chapter 1. The first chapter documents the flattening wage returns to experience for high 


education graduates in France between 1998 and 2017. I compare wage growth over the first 
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seven years in the labor market between three cohorts, who left school in 1998, 2004, and 
2010. I document average wage growth by cohort and education level and find lower-educated 
workers (high school dropouts and high school graduates) experience flat wage profiles in the 
early years of their careers. However, High education graduates display the steepest profile 
for wage growth, but I find it is flattening between the 1998 and the 2010 cohort. 

The French economy has undergone substantial changes over the period, as well as the 
French education system. As a result, the 1998, 2004, and 2010 cohorts enter the labor 
market under substantially different conditions: the 1998 cohort faces high (above 10%) 
unemployment, but counts relatively few high education graduates. The 2004 cohort enjoys 
low unemployment and high demand from firms but counts higher education graduates. 
Finally, the 2010 cohort enters the labor market amid the Great Recession, facing high 
unemployment and low growth. Because the French education expansion is still strong in 
the 2000s, encouraged by the creation of vocational bachelors and the implementation of the 
Bologna Process, the 2010 cohort counts substantially more high education graduates than 
its predecessors. This is likely to hurt their labor market prospects on several levels: first, 
they face reduced demand from firms. Second, high education graduates are more numerous 
than before. The impact of this second fact can be thought of from various perspectives: 
(Gaini et al. (2013); Dupray and Moullet (2010)). First, a degree could be a signal of 
individual productivity. An increase in the number of graduates then implies a decrease 
in their average individual quality, which can be reflected in a slower wage progression. A 
second approach considers the degree as a way of acquiring human capital. The diversification 
of the French higher education system, by modifying degrees’ content, may have negatively 
impacted the acquisition of young graduates’ human capital. Finally, a third approach based 
on the standard neoclassical model predicts a decline in the wages of young graduates if their 
number increases simply because the wage is equal to the marginal product of an additional 
worker. If firms exhibit decreasing returns to scale, every new worker lowers the average 
wage. 

I set out to empirically study the reasons for differentiated wage growth in France since 
the end of the 1990s. Tod o so, I use the French ‘Generations’ surveys made available by 
the CEREQ (Centre d’Etudes et de Recherche sur les Qualifications). The surveys are pre- 
sented as panel data and cover the working lives of school leavers in 1998, 2004, and 2010 for 
seven years, to provide a comprehensive overview of the integration of young people into the 
French labor market. I first decompose differences in average wage growth by occupation 
into an extensive and intensive margin. The extensive margin is driven by a composition 
effect resulting from differences in occupational shares between cohorts. The intensive mar- 


gin rests on the change in annual wage growth by occupation. Occupations who display a 
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negative intensive margin are also the ones who exhibit a large and positive extensive mar- 
gin. Indeed, occupations that know the most important slowdown in wage progression are 
also those which experience the greatest influx of graduates between 1998 and 2010. This is 
in line with a supply and demand interpretation of the wage growth slowdown, whereby an 
over-supply of new graduates prevents them from attaining their predecessors’ wage levels. 
This interpretation suggests exploring mechanisms through which an increase in graduates’ 
supply, along with stagnation in demand, impact early-career wage dynamics. I explore two 
mechanisms: promotion to managerial positions and degree-occupation mismatch. I show 
that obtaining a managerial position is accompanied by an increase in salary in the medium 
term. Hence, a decrease in the probability of obtaining such a position worsens the overall 
wage progression. This is consistent with findings by Kwon et al. (2010). I then examine the 
argument of Liu et al. (2016), who show that in the US, college graduates during the Great 
Recession suffered from a degraded degree-industry match, which led to persistently lower 
wage levels than their older peers. In France, I do not observe a worsening of mismatch 
(defined as the mean first-year wage level for a given degree major within a given occupa- 
tion) between 1998 and 2010, but I find that its importance in determining future wages has 


increased between the 1998 and 2010 generations. 


Chapter 2. Between the 1970s and today, many economies both in the developed and 
developing world have experienced an increase in their educated labor supply. As a result, 
the ratio of educated workers to uneducated workers present in labor markets has risen. This 
chapter seeks to understand the shift in labor supply education’s impact on worker and firm 
matching and wages with a novel model of matching on the labor market. The model is 
structurally estimated on Portuguese matched employer-employee data. In doing so, I am 
able to quantify the impact of supply and demand changes on worker-firm allocation and 
wage structure. 

To capture supply (from workers) and demand (from firms) mechanisms in the labor 
market, I build a static one-to-many matching model with transferable utility. Workers and 
firms differ with respect to their observed characteristics, which are summarized by a multi- 
dimensional type, as well as a stochastic shock that accounts for unobserved heterogeneity. 
A single firm matches with several workers, who constitute a bundle that forms its work- 
force. The surplus created by the match depends on the firms’ observable characteristics 
as well as the workforce. The utility is transferable under the form of wages paid by the 
firm to the workers in its workforce. Firms seek to maximize total profit, which is additive 
in the difference of production and total wage bill, plus random shocks. Workers maximize 


their utility, which is additive in amenities, wage, and a random shock. Amenities embody 
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workers’ inner preference for a given type of firm. At equilibrium, wages clear the market 
and each agent matches with their best option given wages. The model can generate a rich 
distribution of wages that depend both on workers’ and firms’ observable characteristics, 
as well as on the employed workforce. It also predicts equilibrium matching, which is the 
joint distribution of firms and workforces. Using both matching and wages, I can separately 
identify firm production from workers’ amenities. 

The framework offers more flexibility in estimation than classic supply and demand mod- 
els developed in Katz and Murphy (1992) and Card and Lemieux (2001): it identifies worker 
preferences in addition to firm production, as well as varying production parameters over 
time. This is because by explicitly modeling firms’ and workers’ match choices, I can use 
both observed matching and observed wages, which brings more power to identification. The 
model is fitted to the data by assuming parametric forms for firm production and workers’ 
amenities. I classify workers into two education levels, high school graduates and non- 
graduates, and three age groups, young, middle-aged, and senior. Firms are differentiated 
by their sector of activity. Following the literature, I choose a nested Constant Elasticity of 
Substitution (CES) function for production, with productivity parameters for each education 
level that vary between sectors. I assume worker preferences for firms depend on a worker’s 
age, education level, and firm sector. Equipped with model predictions for matching and 
wages, I structurally estimate the model on matched employer-employee data. I estimate the 
model by maximum likelihood on the joint distribution of matching and wages, separately 
every three years. 

The model developed in this chapter is related both to one-to-many assignment problems 
studied in mechanism design (Bikhchandani and Ostroy (2002), Vohra (2011)), and to one- 
to-one matching models used in family economics (Choo and Siow (2006)). This chapter 
bridges the gap between these two literatures: it extends one-sided assignments to two- 
sided matching, and generalizes one-to-one matching to one-to-many. Additionally, I extend 
the econometric framework of Choo and Siow (2006) and Galichon and Salanié (2021) to 
one-to-many matching. 

I use the novel theoretical framework developed to study the Portuguese labor market 
between 1987 and 2017. I highlight three facts on the Portuguese labor market: first, the 
country operates a vast education expansion over the period, which translates in a dramatic 
increase in the relative supply of high school graduates relative to non-graduates on the 
labor market. Second, the high school wage premium decreases over the period. The high 
school wage premium is defined as the wage gap between workers who graduated from 
high school, and those who did not. The decrease in wage premium is particularly stark 


among young workers. Two opposite interpretations could be given for this fact. It could 
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be the consequence of a trade effect: Portugal joined the European Union in 1986, and 
because it had relatively more uneducated workers (workers who did not go to high school) 
in its labor force than other EU countries, a Heckscher-Ohlin model of trade would predict 
an increase in the export of goods that require uneducated labor to produce. Relative 
demand of uneducated to educated labor would rise, and the high school wage premium 
would decrease. The other interpretation rests on a supply effect: even if relative demand 
of educated to uneducated labor increases, the formidable education expansion occurring 
in Portugal in the 1990s and 2000s could take over and cause a decline in the high school 
wage premium. The model described above can distinguish which of the two, trade effect 
or education expansion effect, actually happened. Third, I measure worker-firm sorting, 
which is defined as the relative number of high school graduates over non-graduates in an 
age group employed in a given sector. The distribution of high school graduates versus 
non-graduates across industry sectors becomes highly unbalanced, in favor of services, and 
transports and communications, who employ an increasing share of high school graduates. 
The former two facts imply relative supply of high school graduates over non-graduates 
has grown faster than firms’ relative demand for high school graduates over non-graduates. 
The latter suggests that sorting between workers and firms has evolved over the period: 
either because firms in services and transport and communications demand an increasing 
share of high school graduates, or because high school graduates’ preference for these firms 
strengthens. 

After estimating the model, I find that relative demand for high school graduates from 
firms in the Services, Manufacturing, and Transport & Communications sectors has increased 
dramatically over the period, starting in the early 2010s. This finding is in line with the 
skill-biased technological change hypothesis, rather than the trade hypothesis: it suggests 
an increase in the relative demand of educated to uneducated labor, that is outbalanced by 
the increase in relative supply. I also find that young and middle-aged high school gradu- 
ates’ preference for these industries has declined over time, while their share in production 
increases compared to senior workers. Compared to the classic supply and demand frame- 
work, the model offers two additional mechanisms whereby high school wages gaps stay large 
when a large number of high school-educated workers enter the labor market. First, a de- 
crease in workers’ amenities pressures wages upwards. Second, variation in young graduates’ 
share in production compared to more senior high school graduates increases firm demands 
for the former compared to the latter. I perform several counterfactual exercises to assess 
the separate actions of changes in workers’ demographics (both in education and age distri- 
bution), firm sector composition, firm demand through production parameters, and worker 


preferences, on sorting and wage premium. I find that changes in demographics are the main 
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positive drivers of changes in sorting. Changes in industry composition, firm demand, and 
worker preferences overall have a negative, if modest, effect on sorting. Wage premia by age 
group and industry are negatively affected by changes in worker demography and industry 
composition and positively affected by changes in firms’ demand. These suggest changes 
in relative productivity in favor of high school graduates have driven the high school wage 
premium up, but cannot compensate for the large increase in the relative supply of graduates 


versus non-graduates. 


Chapter 3. Co-authored with Jeremy Fox and Alfred Galichon. This chapter takes a differ- 
ent perspective from the two first chapters: instead of studying worker and firm matching 
in a static world, it asks how dynamic considerations influence matching. Indeed, on many 
matching markets, including the labor market but also the marriage market or startup ven- 
tures, agents account for the fact that their type may evolve in the future, either determin- 
istically (for instance, workers age) or depending on whom they matched with (if a worker 
is employed in a given occupation, she will accumulate human capital in that occupation). 
To understand how these considerations influence partner or employer/employee choices, we 
introduce a tractable model of one-to-one dynamic matching. Agents have individual types, 
such as education and experience for workers, and industry and occupation for jobs. When 
deciding with whom to match, agents account for future expected returns that stem from a 
change in type. In turn, this change in type will affect returns from future matches. Each 
period a matching market takes place, where wages act as market-clearing prices. The goal 
of this chapter is to develop a useful off-the-shelf model of repeated matching games from 
the theory literature that generalizes static matching games to a dynamic setting. It also 
differs from the two previous chapters because the model developed is not only applicable 
to labor questions but could also apply in family economics or industrial organization. The 
repeated matching game with econometric errors can best be explained as the combination 
of two touchstone papers in the literature: Choo and Siow (2006) proposes an estimator 
for static matching games with logit errors. Rust (1987) proposes an estimator for single 
agent, dynamic discrete choice models, often using logit errors. What we do in this chapter 
is combine the two to obtain an estimator for dynamic matching games. 

In our repeated matching framework, each agent has a state variable. Making a match, 
or remaining unmatched can affect the evolution of this agent state variable. Each period, 
agents participate in a matching market with prices or transfers for different matches. Given 
market-clearing prices, each agent selects the best partner in an, importantly, forward-looking 
manner. Next period the matching market reopens, new prices are stated and new matches 


form. A repeated matching game can have both individual and aggregate dynamics. At 
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the individual level, each agent is solving a single-agent dynamic programming problem, 
where each period the agent’s action is to choose a partner to match with. At the aggregate 
level, the state variable of the matching market is the active agents’ current set of types or 
state variables. This aggregate state variable evolves with the decisions of the individual 
agents and is summed up by a social planner Bellman equation. We first develop the model 
without econometric errors in surplus and then account for individual preferences under the 
form of econometric shock. In both cases, we explore two different methods to compute the 
aggregate equilibrium: one method rests on value function iteration on a grid to compute 
the social planner’s value function, and the associated equilibrium on each point of the 
grid, and the second method uses a deep learning model to approximate a fixed point to 
the social planner’s Bellman equation. One of our most important theoretical results is 
that a stationary equilibrium exists, both with and without econometric shocks: there is a 
mass of agent state variables such that, after optimal matches are chosen by forward-looking 
agents, the same masses of agent types occur. The existence of a stationary equilibrium 
does not depend on model parameters and lets the researcher optionally ignore aggregate 
dynamics by imposing that the matching game is at a stationary equilibrium. Focusing on 
the stationary equilibrium, we introduce yet two other methods to compute it: One method 
solves a system of nonlinear equations using a nonlinear programming solver. The second 
method reformulates the problem of finding a stationary equilibrium as a min-max problem 
and uses the Chambolle- Pock primal-dual algorithm to solve it. We show that both these 
methods can scale to problems with many agent types. In addition to computing a stationary 
equilibrium, we can extend the same estimators to structurally estimate parameters in the 
production of a match with an appropriate dataset. We then estimate geographic mobility 
costs for Swedish engineers in the 1970, and find that mobility imposes a sizeable penalty 


on production surplus. 
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CHAPTER 1 


The Flattening Returns to Experience for Higher 


Education Graduates in France 
An Occupational Analysis 


Abstract 


This chapter documents the falling wage returns to experience between cohorts entering 
the French labor market in 1998 and 2010, as experienced by higher education graduates 
in their early careers. Returns remain stable for high school dropouts and high school 
graduates. I decompose differences in average wage growth by occupation into an 
extensive and intensive margin. The extensive margin is driven by a composition effect 
resulting from differences in occupational shares between cohorts. The intensive margin 
rests on the change in annual wage growth by occupation. Occupations that display a 
negative intensive margin are also the ones that exhibit a large and positive extensive 
margin. This finding is consistent with decreasing returns to each new graduate in a 
given occupation. I then study two mechanisms behind the wage growth slow down: 
access to managerial positions and impact of initial match quality. I find access to 
managerial positions is more infrequent for recent cohorts. I also find that initial 
match quality has not worsened between the 1998 and 2010 cohorts, but its impact on 


future wages has become more enduring. 
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1 Introduction 


Wage growth on workers’ career paths, or returns to experience, have been the subject 
of a substantial economic literature, especially since the recession that affected the world 
economy in the late 2000s and early 2010s. A subset of this literature, to which this chapter 
belongs, focuses on individuals’ early careers and compares returns to experience by cohort, 
or generation, i.e. a set of individuals who entered the labor market at the same time. In this 
chapter, I compare the wage growth between three cohorts, who entered the French labor 
market in 1998, 2004, and 2010, over their first seven years of career. I document average 
wage growth by education level and find lower-educated workers (high school dropouts and 
high school graduates) experience flat wage profiles in the early years of their careers. High 
education graduates display the steepest profile for wage growth, but I find it is flattening 
between the 1998 and the 2010 cohorts. 


Over the period studied in this article, from the end of the 1990s to the second half of 
the 2010s, the French economy has undergone two recessions at the beginning and end of 
the 2000s: the high unemployment rate at the beginning of the period (above 10% of the 
total active population) fell sharply during the 2000s before rising again from 2008 onwards 
(without, however, reaching its previous level). GDP grew steadily throughout the period, 
except in 2008 and 2009. At the same time, the French education system evolved in the early 
2000s, as vocational bachelors were created and the Bologna process was implemented. The 
latter reform reorganized the French higher education system into three levels: Bachelor (3 
years), Masters (2 years), and Doctorate (3 years). Both reforms contributed to multiplying 


the number of university graduates entering the labor market and diversified their profiles. 


Two mechanisms are at work between 1998 when the first cohort studied enters the labor 
market, and 2017, the last year the third cohort is observed: demand, reflected in job oppor- 
tunities that vary according to firms’ needs, and supply, since the population’s educational 
composition changes between 1998 and 2017, with an increase in the share of higher educated 
workers. On the supply side, the impact of the increase in higher education graduates on 
wage levels can be thought of from various perspectives (Gaini et al. (2013); Dupray and 
Moullet (2010)). First, a degree could be a signal of individual quality which is unobserved 
by the analyst but observed by the employer, who adjusts the wage accordingly. If we assume 
that the unobserved quality is distributed in the same way among each cohort, an increase in 
the number of graduates implies a decrease in their average unobserved quality, which can be 


reflected in a slower wage progression. In the US, the decrease in graduates’ quality due to an 
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education expansion is evidenced in Carneiro and Lee (2011). A second approach considers 
the degree as a means of acquiring human capital: in general, high human capital would 
imply high wages, because human capital makes workers more productive. If the level of 
human capital conferred by the degree does not change, wages should not change, regardless 
of the number of graduates. But the diversification of French higher education, by modifying 
degrees’ content, may have negatively impacted the acquisition of human capital of young 
graduates. Finally, a third approach based on the standard neoclassical model and developed 
by Katz and Murphy (1992) and Card and Lemieux (2001) predicts a decline in the wages 
of young graduates if their number increases without any change in demand, even if neither 
the unobserved quality nor the content of the degree changes. This approach postulates that 
the wage of an employee is simply on his or her marginal product. If firms produce with 
decreasing returns to scale, an additional employee has a smaller marginal product than the 
employees that were hired before she was. In a context of a strong increase in the number of 
graduates on the labor market without a comparable increase in demand for the most highly 
educated, the diminishing marginal returns approach therefore anticipates lower wages for 
the latest arrivals, i.e. young graduates. This last approach could also impact the demand 


side: if firms’ demand for young high educated workers drops, it negatively affects their wage. 


This chapter sets out to empirically study the reasons for differentiated wage growth in 
France since the end of the 1990s. I first document early career wage progression in France 
between 1998 and 2017 and show that it changes differently depending on education level 
and occupations. Indeed, occupations that know the most important slowdown in wage pro- 
gression are also those which experience the greatest influx of graduates between 1998 and 
2010. This is in line with a supply and demand interpretation of the wage growth slowdown, 
whereby an over-supply of new graduates prevents them from attaining their predecessors’ 
wage levels. This interpretation suggests exploring mechanisms through which an increase 
in graduates’ supply, along with stagnation in demand, impact early-career wage dynam- 
ics. I explore two mechanisms: promotion to managerial positions and degree-occupation 
mismatch. I show that obtaining a managerial position is accompanied by an increase in 
salary in the medium term. Hence, a decrease in the probability of obtaining such a position 
worsens the overall wage progression. This is consistent with findings by Kwon et al. (2010). 
I then examine the argument of Liu et al. (2016), who show that in the US, college graduates 
during the Great Recession suffered from a degraded degree-industry match, which led to 
persistently lower wage levels than their older peers. In France, I do not observe a worsening 
of mismatch (defined as the mean first-year wage level for a given degree major within a 


given occupation) between 1998 and 2010, but I find that its importance in determining 
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future wages has increased between the 1998 and 2010 generations. 


I use the French ‘Generations’ surveys made available by the CEREQ (Centre d’Etudes 
et de Recherche sur les Qualifications). The surveys are presented as panel data and cover 
the working lives of school leavers in 1998, 2004, and 2010 for seven years, to provide a com- 
prehensive overview of the integration of young people into the French labor market. The 
surveys show that the 2010 cohort (defined by individuals who left the education system in 
2010, regardless of their age) has experienced a more difficult situation than its predecessors: 
three years after they entered working life, their unemployment rate was 22%, compared to 
11% for the 1998 generation in 2001 (Epiphane et al. (2019)). The median wage in the first 
year on the labor market is higher for the 2010 cohort than for the 1998 cohort: 1265 ver- 
sus 1090 in constant euros, base 2015. However, the 2010 cohort experiences a slower wage 
growth than the 1998 cohort: after seven years median wages are 1510 and 1500, respectively. 
Besides, the median wage after seven years is higher for the 1998 cohort than the 2010 cohort 
for higher education graduates, indicating a strong slowdown in salary progression for the 
highly educated. 


Related literature. The present analysis relates to several literatures. First, there 
exists a wide literature on wage inequality by education level, that usually attributes the in- 
creasing education wage premium of the 1070s and 1980s to skill-biased technological change 
(Bound and Johnson (1992), Katz and Murphy (1992), Card and Lemieux (2001)). The sub- 
sequent flattening of the education wage premium in the 1990s and 2000s (Card and DiNardo 
(2002)) is either explained through education expansions experiences in Europe (see Verdugo 
(2014) on France and Blundell et al. (2022) on the UK), on employment protections that 
dampen skill-biased technological change (Doepke and Gaetani (2020) in Germany), or to a 
simple reversal in firms demand for cognitive skill (Beaudry et al. (2015), Valletta (2016)). 
On the opposite side of the spectrum Blair and Deming (2020) and Hershbein and Kahn 
(2018) argue the Great Recession has accelerated skill-biased technological change rather 
than slown it down. Finally Lindley and McIntosh (2015) and Lindley and Machin (2016) 
evidence that if the wage structure has compressed overall, wage inequality within college 
graduates has increased depending on graduates’ chosen major and postgraduates studies. 
Second, several papers have found evidence of a flattening life-cycle wage profile: Manovskii 
and Kambourov (2005) document deflating life-cycle earnings for men in the US since the 
1960s, which Jeong et al. (2015) explain entirely through the demographic changes (i.e. the 
worker supply side) that occurred over the period. Similarly, Guvenen et al. (2017) document 


stagflation of men’s lifetime income in the US and Kong et al. (2018) show that the labor 
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earnings ‘multiplier’ between 25 and 55 has decreased from 4 to 2.6 between 1940 and 1980 
for college-educated workers in the US. To explain this fact, they build a model of schooling 
and human capital accumulation on the job and find that the decrease is entirely due to the 
increasing price of skill: because high skilled workers are more demanded by firms, college 
enrolment has increased, which drove up levels of human capital at career start, resulting in 
slower earning growth afterward. Over the shorter term in France, I find no evidence of this 
mechanism: average starting wages for high education graduates in 1998, 2004, and 2010 
are the same, which indicates younger cohorts do not have a human capital advantage over 
older ones. In related work, Ashworth et al. (2021) evidence the decrease in men’s wage 
growth between 1979 and 1997 in the US, especially for college graduates. Their model of 
schooling and work decision points to composition effects both in terms of observable and 
unobservable skills to explain variation across cohorts. Third, this chapter relates to the 
literature on labor market entry conditions and their impact on workers’ career trajectories 
and medium and long-term earnings (see von Wachter (2020) for a complete survey): Gregg 
and Tominey (2005) finds that exposure to unemployment still carries a wage penalty for 
workers after 20 years. Stevens (2008) attenuate this finding by showing that labor market 
outcomes of low and medium-skilled workers are not very vulnerable to economic conditions 
at the start of the career. Focusing on college graduates in the US Oreopoulos et al. (2012) 
show negative and persistent effects graduating in a recession on wage. They also show 
that workers partially recover through mobility towards better-paying firms. This finding 
is related to Liu et al. (2016)’s work on mismatch in the US: they show that in a reces- 
sion mismatch between college majors and firms’ sectors increases, which drives wages down 
and creates a persistence in the decrease. Job mobility allows to dampen the effect over 
time through new matches with better firms( Fredriksson et al. (2018), Berge (2018)). In 
this chapter, I find on the contrary that in France mismatch does not worsen for younger 
cohorts, but its weight on future wage determination intensifies. Job mobility is not suffi- 
cient to alleviate this intensification. Brunner and Kuhn (2014) differentiate their analysis 
by socio-professional category in Austria and show that the recession particularly affected 
blue-collar workers, as they are stuck in low-quality jobs longer than white-collar workers. I 
also take an occupational angle in this chapter, but do not find white-collar occupations to 
be less affected than blue-collar occupations: rather, I find that the occupations that suffer 
the most from deteriorated wage growth are the ones that experience the largest influx of 
new graduates. Not all work finds long and persistent effects on entering the labor market 
in a recession: for instance Berge and Brouwers (2017) shows the wage penalty lasts about 
fours years for high education graduates in the Netherlands. Finally, this analysis is closely 


related to Rothstein (2020) who shows wages decline in the short and medium-term in the 
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US, starting for cohorts who enter the labor market in 2005: I find a similar trend in France 
since the middle cohort I observe, who enters the labor market in 2004, already displays 
deteriorated wage growth for high education graduates compared to the 1998 cohort, even 


if it enters the labor market in times of good economic conditions. 


Section 2 describes the economic context in which the 1998, 2004, and 2010 cohorts 
entered, as well as the data from the Generations Surveys and the main variables of interest. 
I also introduce the empirical framework. Section 3 presents a decomposition of wage growth 
by socio-professional category. In section 4, I present two mechanisms of the slowdown in 
wage growth among the most highly educated. Finally, section 5 presents robustness tests 


and section 6 concludes. 


2 Data and Empirical Strategy 


2.1 The French labor market between 1998 and 2017 


Wage changes examined in this article are part of general trends on the French labour 
market between 1998 and 2017. Table 1.1 uses INSEE census data to provide a general 
overview of changes in the composition of the educational levels and occupations of the 
working population between 1999 and 2011. On the supply side, the share of higher edu- 
cation graduates in 1999 was 24.6% of the working age population. In 2011, this share is 
36.4%, a gain of almost 3 million individuals. The share of individuals with a high school 
degree has also increased, but to a lesser extent. The evolution of demand for each education 
levels is more difficult to assess, and I approximate it by the share of each occupation in the 
general population. The occupations whose numbers increased the most between 1998 and 
2011 are managers and higher intellectual occupations (MHIO) and intermediate occupa- 
tions (IO). Employees and craftsmen, shopkeepers and business owners saw their numbers 
stagnate, while they decreased for farmers and plant workers. In 1999, the MHIO positions 
were mainly occupied by higher education graduates (76.3%), and this share increased in 
2011 (82%). However, the share of higher education graduates in intermediate occupations 
has also increased, from 43.7% to 55.1%. In absolute terms, this increase even surpasses 
that of the MHIOs: 1,026 thousand individuals compared to 920. The strong link between 
tertiary graduates and MHIOs in 1999 has thus been weakened in favour of IOs in 2011. 
Three mechanisms may jointly explain this evolution: firstly, the nature of the tasks re- 
quired within the MHIOs and IOs may have changed. The literature on job polarisation in 
France (Albertini et al. 2017; Patel 2020) associates MHIOs with abstract tasks, and IOs 
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with routine tasks, with higher education graduates being most suited to abstract tasks. If 
it is the case that task content required in IOs positions tends towards more abstraction, the 
demand for higher education graduates should increase within this occupation. Secondly, it 
may be that supply of higher education graduates in 2011 is above demand from MHIOs, 
pushing them towards intermediate occupations by default. Finally, either the content of 
higher education degrees, or the graduates themselves may have changed between 1999 and 
2011, and higher education graduates who entered the labour market between 1999 and 2011 
are more productive at performing routine than abstract tasks. Understanding how the type 
of tasks associated with different occupations has evolved over time is already a focus in the 


polarization literature, hence I choose to explore the last two mechanisms in this paper. 


Table 1.1: Education Levels by Occupations within French active population in 1999 and 
2011 


1999 2011 Difference 


Occupation Nb (k) % HS %HE|/Nb(k) % HS %HE|HS(k) HE(k) 
Farmers 532 16.1 7.5 344 28.3 18.4 2 23 


Craftmen, retailers, | 1 407 15.0 14.9 1367 821.4 23.9 82 116 
business owners 
Top managers, | 2802 10.6 76.3 3 726 9.2 82.0 47 920 
highly qualified 


professionals 

Mid-level managers | 5100 = 21.8 43.7 5905 = 21.1 55.1 134 1 026 
Employees 6587 =16.7 10.5 6522 24.8 20.5 516 646 

Factory workers 5 827 6.2 Deh 5 162 15.1 6.8 418 193 

Total 22255 814.2 24.6 | 23026 19.0 36.4 | 1210 2925 


Nb (k): Number of individuals in thousands 
HS: High School degree, HE: Higher Education degree 


Lastly, changes in unemployment rate also reveals disparity between supply and demand. 
Public data from INSEE (Institut National de la Statistique et des Etudes Economiques) 
on unemployment rate by level of education show a systematically higher rate for high 
school graduates than for higher education graduates. Moreover, the unemployment rate 
increased between 1997 and 2017 for non-graduates, from 14% to around 17%, and high 
school graduates (around 12%), while it decreased for higher education graduates (from 7.5% 


to around 4.5%). This observation is consistent with an adjustment by unemployment among 
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the less educated during a fall in demand, as the minimum wage prevents any adjustment of 
wages (Gaini, Leduc, and Vicard 2013). For higher education graduates, on the other hand, 
any friction between supply and demand is rather reflected in wage levels rather than in the 


unemployment rate. 


2.2 The data 


The Generations Surveys are presented in the form of a panel: each observation corre- 
sponds to the activity of an individual (employment or unemployment) over a given period, 
called a ‘sequence’ (or spell). CEREQ conducts its surveys on a given cohort every two 
or three years. For instance, the 2010 cohort is surveyed in 2013, 2015 and 2017. Only 
individuals who responded to all three surveys are considered here. The three surveys are 
unequal in terms of the number of individuals surveyed: there are twice as many individuals 
surveyed from 1998 versus the 2010 cohort. To account for these differences, and any selec- 
tion effect that may arise from attrition, the Generation Surveys provide the analyst with 
a weighting per individual so that each survey is representative of the population of young 
French workers. I adapt this weighting in two ways: first, I normalise it so that each of the 
generations 1998, 2004 and 2010 has the same weight. Second, since the data are presented 
as individual-spell observations, individuals who change spell frequently is greatly increased 
in the analysis (interim workers for instance). To avoid this giving them too much weight, I 
weigh spells of individuals who change status several times a year according to spell length. 


The entire analysis will be weighted by these modified weights. 


Table 1.2: Number of individuals and spells by cohort 


| Gen 1998 Gen 2004 Gen 2010 
Number of individuals 13:673 9 633 7 500 
Number of spells 63 965 45 348 34 730 
Number of employment spells | 27 618 21 576 15 533 


The analysis focuses on employment spells and starting (or entry) wage obtained by 
young workers hired at the beginning of these spells, the changes in which is compared be- 
tween the 1998, 2004 and 2010 cohort by level of education. Each individual first spell starts 
the month after graduation or after they left school if they did not graduate. The surveys 
also provide the last wage obtained at the end of each spell, but no intermediary wage. I 
choose to focus on entry wage, because this is invariant to the duration of the employment 


spell. Using the INSEE consumer price index series, | compute wages in constant prices in 


48 


euro 2017. 


I exclude from the analysis spells in which individuals are under 16 years old, as well as 
the employment spells for which the monthly starting wage is less than € 200 or more than 
€ 20,000. The analysis focuses on job spell for which the location (at the ‘department’ level), 
firm sector and occupation are known. I consider only sequences in metropolitan France, 


between year 1 and year 8 of each cohort. 


The main characteristics of the individuals are described in Table 1.3: there are no major 
differences between cohorts in terms of the average age just after leaving the education 


system, the gender distribution, or the average number of spells after seven years. 


Table 1.3: Age, gender, and individual number of spells by cohorts 


| Gen 1998 Gen 2004 Gen 2010 
Average age at entry on labor market 21:36: 21,2 21.3 
% Men 0.51 0.53 0.51 
Average number of spells 4.9 5.3 5.1 
Average number of employment spells yaa 2.4 2 


I consider two main dimensions of individuals and their employment: educational at- 
tainment and occupation. I group individuals into four education levels: no degree (left the 
education system without finishing secondary school), secondary education (obtained either 
a general high school degree, or a vocational degree), short higher education (obtained a 
degree in less than four years, either a bachelor or a technical degree), and long higher edu- 
cation (obtained a degree in more than four years, either masters or PhD). Table 1.4 presents 
the composition of each cohort by level of education: the proportion of long higher education 
graduates (more than four years of higher education) is greater in the 2010 generation than 
in the 2004 and 1998 cohorts, while the proportions of short tertiary graduates (between one 
and three years of higher education) and secondary school graduates (CAP, BEP or Bac- 
calauréat) are lower. The proportion of individuals without a diploma (having left school 
with a brevet level) is higher in the 2010 cohort. The Generations surveys therefore show a 


polarisation of educational provision between 1998 and 2010. 
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Table 1.4: Education level shares by cohort 


Education level (%) | Gen 1998 Gen 2004 Gen 2010 
No degree 8.9 7.9 V2 
High school degree 52.3 53 42.7 
Short higher education degree 28.1 27.6 23.3 
Long higher education degree 10.7 11.5 16.9 
Total 100 100 100 


Table 1.5 shows the decomposition of occupations for first job by cohort. The share of 
managers and upper occupations increased between the 1998 and 2010 cohorts. The share of 
intermediate occupations has also increased, at a faster pace. As in the general population, 
the share of blue-collar workers has decreased, and the share of white-collar workers has 
stagnated. Because farmers represent too small a share of the employment spell, these spells 


are excluded from the rest of the analysis. 


Table 1.5: Occupation shares by cohort 


Occupation (%) | Gen 1998 Gen 2004 Gen 2010 
Farmers 0.7 0.2 - 
Craftmen, retailers, business owners 1.1 0.7 0.1 
Top managers, highly qualified professionals 12.1 11.4 18.6 
Mid-level managers 25.2 29.9 30.3 
Employees 28.2 26.9 26.4 
Factory workers 32.7 30.9 24.6 
Total 100 100 100 


2.3 Strategy 


Figure 1.1 shows the evolution of average starting wage by cohort and education level 
reported by individuals on the Generation Surveys. Education levels are grouped into four 
categories: no degree (individuals who left school at brevet level), secondary degree, short 
tertiary diploma (graduates of a higher education degree in three years or less), and long 
tertiary degree (graduates of a higher education degree in four or more years). This graph 
shows no significant difference in how starting wages growth over time between cohorts for 
three out of four levels of education: non-graduates, secondary school graduates and short 


tertiary graduates. On the other hand, long-term tertiary graduates’ entry wage growth 
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differs between cohorts: the 2010 cohort experiences a slower growth than the 2004 and 1998 
cohorts. This slowdown becomes more pronounced over time: the three cohorts begin their 
working lives with similar entry wage, and then diverge. While the 1998 cohort enjoys a 
significant average increase in starting wage from their second year on the labour market, 
the 2004 cohort’s average starting wage only really increased after three years on the labour 
market, and that of the 2010 generation after four years. The result is that the 2010 gener- 
ation is significantly behind its predecessors, a gap that persists beyond the 2010-2012 crisis 
period (years 1 to 3 for the 2010 generation), without any catching up taking place in the 


subsequent years available in the survey. 
Figure 1.1: Average entry wage over time in euros, by cohort and education level 


No degree High school degree 


2 4 6 8 2 4 6 8 
Years spent on labor market Years spent on labor market 
Short higher educ. degree Long higher educ. degree 


Years spent on labor market Years spent on labor market 


Cohort + 1998 + 2004 = 2010 


To understand the reasons behind the divergence observed in Figure 1.1, I use the follow- 
ing framework: individual i enters employment contract 7 = J(i,t) in year t. Each contract j 
is characterised by the characteristics of the firm, such as the industry or region, but also by 
features specific to the individual’s role in the company, such as occupation. The individual 
also displays specific characteristics such as cohort or level of education. Entry monthly wage 
under contract j in year t is w,;;. The regression below allows to decompose the evolution 
of average entry wages by cohorts and level of education, considering possible differences in 


the characteristics of contracts or individuals: 
log wijt = > ig aagaai eg Re OE ee Sa (1.1) 
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where log w;; is the logarithm applied to entry wage, a; is the number of years since 
leaving the education system (between 1 and 8), e; is a fixed effect for education, g; a fixed 
effect for the individual gender, 1; is a fixed effect for region and s,; for industry within which 


the contract takes place. 


The estimator {-, Is computed by education level e and cohort g. It measures the average 
increase in entry wages per year, for each cohort and education level, controlling for varia- 
tions in gender, industry, and region. Comparison of estimators between cohorts is based on 
the following identification assumption: the distribution of unobserved heterogeneity is the 


same for all generations. This assumption will be maintained for the rest of the analysis. 


One way of approaching the variations of 6-, between cohorts in a context of changing 
supply and demand is to decompose average entry wage growth not only by cohort and level 
of education, but also by occupation. In fact, by highlighting the heterogeneity of wage 
growth by occupations, I can identified two margins of divergence: an extensive margin and 
an intensive margin. The extensive margin highlights the variations in the share of new 
hires in each occupation, keeping wage evolution constant. The intensive margin focuses on 
variations in wage levels, holding constant the share of each occupation in new hires. This 


margin decomposition proceeds in two steps: first, define w?.,, average entry wage cleaned 


agt? 
fixed effects in the previous regression: 


log Wee = S° Tieshs=6) S° Divaie=aGe4 X At + Ex jt: (1.2) 


g € 
The second step is to project log wy, onto the space of education and occupation by 


estimating the following regression by cohort: 


log Wis = S- TV edue;=e] S- 1 foce;=p] Yeap X at + Exjt- (13) 
e 9 


The following decomposition is then carried out, for a given level of education e : 


Be,2010 = Be,1998 a y Me,2010,p * Ye,2010,p — y Ne,1998,p * Ye,1998,p> (1.4) 
Pp p 


where Ne,1998,p Ct Ne2010,» are the respective proportions of each occupation p within the 
education level e and the 1998 and 2010 generations. Introducing the cross term }},, Ne,2010,p X 


4e,1998,p we obtain: 


52 


Be,2010 — Be,1998 = S° (%e,2010,p — Ne,1998,p) X Ye,1998,p 


: : ; (1.5) 
_ > Me,1998,p * (Ve,2010,p =; Ve,1998,p) . 
p 


The first term (Ne 2010,» — Ne,1998,p) X Ye,1998,» Corresponds to the extensive margin: the 
share of the change in the slope of entry wage growth due to changes in share of occupation p 
within new hires. The second term n¢,2010,p X (Ve,2010,p — Ye,1998,p) iS an intensive margin: the 
share of the change in the slope strictly due to the change in the slope for specific occupation 
p, holding constant the share of each occupation in new hires. This decomposition seeks to 
separate a pure demand or composition effect (changes in the occupation of new hires between 
cohorts, i.e. the extensive margin) from a supply and demand equilibrium effect (changes in 


the distribution of education levels within individuals, captured by the intensive margin). 


3 Results 


Estimation results for equation (1.1) are presented in Table 1.6. Coefficients for entry- 
level wage growth are significant for all cohorts and levels of education. Hiring wages of 
individuals with no degree and high school graduates increase slightly during the first seven 
years on the labor market for all generations (about between 1.9% and 3.7% per year). Both 
short and long higher education graduates experience a more sustained growth in wage, but 
it is less pronounced for the 2004 and 2010 cohorts than the 1998 cohort (3.7% annual growth 
compared to 2.3% for short higher education graduates and 4.6% compared to 2.4% for long 
higher education graduates for 1998 and 2010 cohorts). Long higher education graduates 
suffer most from the slowdown in wage growth: the 2010 cohort’s growth loses almost half 
of its 1998 predecessors’ growth. The rest of the analysis will therefore focus on long higher 


education graduates, although results are presented for all levels of education. 
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Table 1.6: Log entry wage regressed on number of years spent on the labor market by 
education level, with gender, location and industry fixed effects 


log entry wage 
Gen 1998 Gen 2004 Gen 2010 


G0o7 0.0387" —qoa7 
(0.003) (0.003) (0.003) 
0.026* 0.019%" —0.03*** 
(0.001) (0.001) (0.002) 
Years x Short higher educ. 0.037" ~——-0.025*** ~——-0.023*** 


Years x No degree 


Years x High school degree 


degree (0.002) (0.002) (0.002) 
Years x Long higher educ. 0.046*** 0.045*** 0.024*** 
degree (0.003) (0.003) (0.003) 
FE education v v v 
FE gender v v v 
FE location v v v 
FE industry v v v 
Observations 37 785 27 656 20 130 
Re 0.325 0.244 0.283 


*p<0.1; **p<0.05; ***p<0.01 


All occupations except Farmers 


Estimation results for equation (1.3) are presented in Table 1.7 by cohort, for all levels 
of education and all occupations, except farmers, because their number is not large enough 
to obtain a robust estimate. Wage growth heterogeneity between the 1998 and 2010 cohorts 
by occupation is clearly apparent for higher education graduates. In particular, mid-level 
managers and top-level managers and highly qualified professionals are particularly affected 


by the slowdown in wage growth. 


Table 1.7: Log entry wage regressed on number of years spent on the labor market by 
education level and occupation, with gender, location and sector fixed effects 


log entry wage cleaned of fixed effects 
Gen 1998 Gen 2004 Gen 2010 


Years <x No degree x Craftmen, 0.075*** 0.068*** 


Shopkeepers, Business owners 
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(0.008) (0.012) () 
Years x No degree x Top 0.064*** 0.078*** 0.08*** 
managers, Highly qualified prof. (0.017) (0.021) (0.016) 
Years x No degree x Employees el te 
(0.002) (0.003) (0.002) 
Years <x No degree x Factory 0.03*** Oat 0.037*** 
workers (0.001) (0.002) (0.002) 
Years x No degree x Mid-level 0.022*** 0.043*** 0.037*** 
managers (0.005) (0.005) (0.003) 
Years x HS deg. x Craftmen, 0.012*** 0.056*** 0.022 
Shopkeepers, Business owners (0.003) (0.004) (0.024) 
Years x HS deg. x Top 0.074*** 0.069*** 0.0638*** 
managers, Highly qualified prof. (0.003) (0.004) (0.005) 
Years x HS deg. x Employees ae Bee 
(0.001) (0.001) (0.001) 
Years x HS deg. x Factory 0.025***  0.018*** 0.026*** 
workers (0.001) (0.001) (0.001) 
Years x HS deg. x Mid-level 0.043*** 0.031*** 0.041*** 
managers (0.001) (0.001) (0.002) 
Years x SHE. deg. x Craftmen, -0.001 Q,041"" 0.069 
Shopkeepers, Business owners (0.006) (0.009) (0.058) 
Years x SHE. deg. x Top 0.082*** 0.065*** 0.065*** 
managers, Highly qualified prof. (0.002) (0.003) (0.004) 
Years x SHE. deg. x Employees re ane ye 
(0.002) (0.002) (0.003) 
Years x SHE. deg. x Factory -0.013*** -0.003 -0.001 
workers (0.003) (0.003) (0.004) 
Years x SHE. deg. x Mid-level 0.043*** 0.034*** 0.029*** 
managers (0.001) (0.001) (0.002) 
Years x LHE deg. x Craftmen, 0.057*** 0.048** M079" 
Shopkeepers, Business owners (0.009) (0.019) (0.035) 
Years x LHE deg. x Top 0.058*** 0.067*** 0.047*** 
managers, Highly qualified prof. (0.002) (0.002) (0.002) 
Years x LHE deg. x Employees es ee ppt 
(0.007) (0.006) (0.005) 
Years x LHE deg. x Factory -0.045* -0.049*** -0.054*** 


workers 
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(0.023) (0.014) (0.009) 


Years x LHE deg. x Mid-level 0.002 0.003 -0.011*** 
managers (0.004) (0.003) (0.003) 
Observations 37 785 27 656 20 130 
R? 0.182 0.139 0.155 


*p<0.1; **p<0.05; ***p<0.01 

Cells are empty where there were too few observations 

All occupations except Farmers 

HS: High School, SHE: Short higher educ., LHE: Long higher educ. 


Figure 1.2 presents the results of equation (1.5)’s decomposition by level of education. It 
shows that the slowdown in wage growth for higher education graduates between the 1998 
and 2010 cohorts is mainly driven by mid-level managers and top-level managers and highly 
qualified professionals, who account for almost 100% of the total margin for both short and 
long higher education graduates. However, the extensive margin of mid-level managers be- 
haves differently from that of top-level managers and highly qualified professionals: it is 
positive for the former, indicating an increase in the share of long higher education grad- 
uates among mid-level managers, and negative for the latter, signalling a decline in their 
share among top-level managers and highly qualified professionals. This suggests that the 
increase in the proportion of higher education graduates (especially long higher education) is 
unmatched by demand of top-level managers and highly qualified professionals. As a result, 
an increasing share of higher education graduates is absorbed by mid-level managers occupa- 
tion. Intensive margins for mid-level managers and top-level managers and highly qualified 
professionals are both negative, for higher education graduates. Because the intensive mar- 
gin is particularly for short higher education graduates working in mid-level management, 
it may be that the ‘absorption’ by mid-level manager of long higher education graduates 
negatively impacts the growth of entry-level wages. This observation is consistent with a 
framework of diminishing marginal returns, where last entrants’ wages decrease because of 


their lower productivity. 
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Figure 1.2: Average wage growth decomposition by education level and occupation 
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The previous analysis can be replicated at a finer level of occupation aggregation available 


in the data. The results of this second level of analysis are presented in Figure 1.3, for 


all higher education graduates and mid-level managers and top-level managers and highly 


qualified professionals only. 


Figure 1.3: Average wage growth decomposition by education level and disaggregated 
occupation - All higher education graduates 
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Figure 1.3 shows a significant heterogeneity in margins within the mid-level managers 
and top-level managers and highly qualified professionals. Sub-occupations can be classified 
into four categories according to their intensive and extensive margins: the first includes ad- 
ministrative staff in the private sector, teachers, and the liberal occupations. It contributes 
significantly to the slowdown in wage growth, is driven by both the intensive and extensive 
margins, although the intensive margin dominates in absolute terms. A second category is 
made up of engineers and health and social workers, whose contribution to the total slowdown 
in wage growth at the time of hiring is no less significant but is broken down differently from 
the first category. Their extensive margin is positive: the proportion of individuals start- 
ing contracts in these occupations increased between the 1998 and 2010 cohorts. However, 
their intensive margin outweighs their extensive margin, and the sum of the two is negative. 
The third category, which differs from the first two in that it has a positive total margin, 
comprises public and private sector managers: the contribution of these occupations to wage 
growth benefits the 2010 cohort compared with the 1998 cohort. This is due solely to a 
positive extensive margin, which is particularly high for private sector managers. Finally, 
the last category includes all other sub-occupation that have a small impact on changes in 


wage growth. 


Except for two occupations (researchers and artists), all occupations suffer from a neg- 
ative intensive margin, i.e. reduced wage growth at hiring, once composition effects (the 
extensive margin) have been accounted for. Besides, occupations in categories two and three 
identified above whose intensive margin is largest in absolute value are also those whose 
extensive margin are most important and positive. This is consistent with a framework of 
diminishing returns: the influx of new employment contracts in these occupations, driven 
by an increase in the supply of higher education graduates, leads to a drop in marginal pro- 
ductivity of individuals who have recently entered the market, which translates into lower 


returns to experience. 


In the next section, I explore two possible mechanisms involved in the intensive margin 
of the wage growth slowdown and how they translate differently into each occupation. 


4 Mechanisms 


I study two mechanisms that are likely to cause heterogeneity within the wage growth 
slowdown intensive margin: the first is promotion to managerial positions: it has been 


documented in the literature (Kwon et al. (2010)) that wage growth is affected if promotions 
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become scarce for a given cohort. Such a mechanism could interact with the heterogeneity 
of the wage slowdown by level of education and occupations if the influx of graduates into 
some occupations, as evidenced by their large extensive margin, is correlated with a decline 
in the share of managerial positions at hiring, which leads to a lower increase in entry 
wages. In a theoretical perspective, the reasons for such a correlation are twofold: first, 
in a framework of decreasing returns an influx of graduates is not followed by demand for 
managers from firms, which mechanically reduces their proportion among new hires. Second, 
if the expansion of higher education graduates on the labour market is accompanied by a 
decline in their unobserved quality, it would lead to a decline in the share of managers within 
cohorts that experience the educational expansion. To check the validity of this mechanism, 
I establish two facts: first, lower access to managerial positions is linked to lower wages on 
hiring, and second, that hiring in those positions does fall for the 2010 cohort compared 
to the 1998 cohort. If it is particularly the case for occupations with the largest extensive 
margins , it suggests that the mechanism is rather based on the theory of decreasing returns, 
since occupations with the largest influx are most affected. If, on the other hand, I observe 
that the decline managerial positions is the same for all occupations, it would indicate that 
a decrease in unobserved quality drives the mechanism. 

The second mechanism I evidence is the degree specialization (or major) and occupation 
match quality. In line with theories of human capital, Liu et al. (2016) show on US data that 
poor match quality in the early years on the labour market weighs on wage developments in 
the medium to long term. Initial matching can be expected to have an impact on a cohort’s 
medium-term wages in two ways: either initial match quality is the same across cohorts 
on average, but its impact on subsequent wages becomes stronger and more persistent, or 
initial match quality decreases across cohorts, and subsequent wage are negatively impacted. 


I show that the results obtained are consistent with the first explanation. 


4.1 Promotion to managerial positions 


To determine whether individuals are hired as managers, I use the question ‘Do you 
manage a team?” in the Generations Surveys. This question provides is more accurate for 
my purpose than using mid-manager or top-manager occupations, as it is unclear whether 
individuals hired to these occupations do in fact manage some of their colleagues. Table 
1.8 shows that the managerial hires increase with the level of education for all cohorts, but 
also that the share is higher for the long higher education graduates among the 1998 than 
the 2010 and 2004 cohorts. Besides, for all cohorts and levels of education, the share of 


managerial hires is higher in the early years on the labour market. Particular attention 
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should therefore be paid to recruitment opportunities in the first few years on the labour 


market. 


Table 1.8: Share of managerial posisitions obtained by cohort, year 1-2 and year 7-8 


Gen 1998 Gen 2004 Gen 2010 
% Manager Year 1-2 Year 7-8 Year 1-2. Year 7-8 Year 1-2 Year 7-8 
Sans diplome 19.2 I3 11.8 22 8.5 124 
Diplome du secondaire 17.8 15.6 12.2 12.2 12.8 13.4 
Diplome du tertiaire court 19.8 18 16.4 14.1 18 LQ 
Diplome du tertiaire long 34.6 33.6 28.8 28.1 26 22.9 


To understand the link between managerial positions and wage levels in the medium 


term, I estimate the following regression is made by cohort, only at years 7 and 8: 


log wijt = Tinea Messe x My Gi Yj Sj Exjt, (1.6) 
e Pp 

where Mj, is a binary variable equal to 1 if the new job is a manager position, and 0 
otherwise. The estimator ¢,ep indicates the average wage gain of a manager position com- 
pared to a non-manager position in the medium term, by cohort, level of education, and 
occupation. Long and short higher education graduates are grouped into the same category 


of higher education graduates. 


Table 1.9 presents the three regressions (1.6) for the 1998, 2004 and 2010 cohorts. The 
impact of a managerial position on entry wages varies according to level of education: sig- 
nificant coefficients are all positive for high school and higher education graduates (except 
employees for 2004 high school graduates). On the other hand, the relationship is negative 
and significant for individuals with no degree working as employees and factory workers, in- 
dicating that a managerial position at this level of education does not offer the same benefits 
as to other levels. The relation between managerial position and entry wages is particularly 
strong for higher education graduates working as mid-level managers, and top managers and 
highly qualified professionals. However, the intensity of the relationship decreases between 
the 1998 and 2010 cohorts: among top managers and highly qualified professionals, a man- 
agerial position is associated with a salary 63% higher for the 1998 cohort and only 51% 
higher for the 2010 cohort. This decline could be the result of a drop in managerial produc- 


tivity due to the particularly large influx of graduates into these occupations. However, the 
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relationship remains significant and suggests examining the evolution of access to managerial 
positions between the 1998 and the 2010 cohorts. 


Table 1.9: Log entry wage regressed on dummy for managerial position by education level 
and occupation, with gender, location and sector fixed effects 


log entry wage 
Gen 1998 Gen 2004 Gen 2010 


0.226 
Manager x No degreex C/S/BO (0.178) ( ( 
Manager x No degreex 0.23 0.231 
TM/HQP (0.303) () (0.267) 
Manager x No degree x -0.199** 0.01 
Employees (0.086) () (0.084) 
Manager x No degreex Factory -0.103* 0.094 -0.143*** 
workers (0.055) (0.177) (0.053) 
Manager x No degree x 0.067 0.574** 0.111 
Mid-level managers (0.149) (0.228) (0.083) 

-0.043 


Manager x HS deg.x C/S/BO (0.078) () () 


0.554 0.374" 0.374" 
(0.086) (0.086) (0.078) 
-0.048  -0.105** =~ -0.052 
(0.04) (0.051) — (0.065) 


Manager x HS deg.x TM/HQP 


Manager x HS deg. x Employees 


Manager x HS deg.x Factory 0.045 0.073* 0.023 
workers (0.032) (0.038) (0.054) 
Manager x HS deg. x Mid-level 0.199*** 0.155*** O.18i""* 
managers (0.035) (0.034) (0.041) 
0.575" 0.568 


M HE. deg.x C/S/BO 
anager X eg.x C/S/ (0.12) () (0.378) 


0.634*** — 0.622*** ——-0..508*** 


M x HE. deg.x TM/HQP 
anager eg /HQ (0.028) (0.038) (0.037) 


Manager x HE. deg. x 0.101 O21™ 0.148 

Employees (0.107) (0.103) (0.117) 
Manager x HE. deg.x Factory 0.013 0222" 0.073 

workers (0.118) (0.114) (0.114) 
Manager x HE. deg.x Mid-level 0.413*** 0.248*"* 0.354*** 
managers 
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(0.043) (0.043) — (0.048) 


FE gender, location, industry v v v 
Observations 4 730 3 433 2 792 
R? 0.332 0.277 0.291 


*p<0.1; **p<0.05; ***p<0.01 

Cells are empty where there were too few observations 
Observations are the sequences between years 7 and 8 
HS: High School, HE: Higher educ. 

C/S/BO: Craftmen, Shopkeepers, Business owners 
TM/HQP: Top managers, Highly qualified prof. 


Probability to access a manager position by cohort and level of education within each 


occupation is assessed using the following logistic regression: 


Myt = Se Tieaeea Y, Madeeetee X a +e; 4 0; Gi YT; Sj Eijt, (1.7) 
e p 


where M,; is the binary variable equal to 1 if the individual is hired as a manager and 
0 otherwise. Years since leaving the education system a; range from 1 to 6 inclusive. The 
coefficient Pyep = 100 x (exp(igep) — 1) informs on the probability increase each year in per- 


centage terms. 


Table 1.10 shows the change in Pgep between 1998 and 2010, by level of education. 
Among 1998 higher education graduates hired a top managers or highly qualified profes- 
sional, chances of accessing a managerial position increase by about 1.2% per year. This 
percentage is negative and non-significant for the 2010 cohort. For middle-managers the 
increase in probability is non significant for the 1998 cohort, and negative and significant for 
the 2010 cohort (-1.1%). 


Table 1.10: Chances of obtaining a managerial postition by years spent on the labor market, 
by education level and occupation 


Managing position 
Gen 1998 Gen 2004 Gen 2010 


3.968" 


Year x No degree x C/S/BO (0.02) () () 
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17.697*** —_-3.971 -5,156 
(0.026) (0.058) (0.042) 
-1.464"" = -1.212* 0.921" 
(0.006) (0.007) — (0.005) 


Year x No degree x TM/HQP 


Year x No degree x Employees 


Year x No degree x Factory -0.876* -1.14* 0.519 
workers (0.005) (0.006) (0.005) 
Year x No degree x Mid-level 0.751 -4,.375*°** 1.142 
managers (0.009) (0.009) (0.008) 
0.06 3.692 0.521 
Year x HS deg. x C/S/BO 
(0.014) (0.074) (0.047) 
3.281°* -1.185 -1.555* 
Year x HS deg. x TM/HQP 


(0.007) (0.008) (0.009) 
0.641% -0.012—--1.054*** 


Year x HS deg. x Employees 
a (0.003) (0.003) (0.003) 


Year x HS deg. x Factory -0.305 0.309 -0.207 
workers (0.003) (0.003) (0.004) 
Year x HS deg. x Mid-level -0.126 -0.515 -0.571 
managers (0.004) (0.004) (0.004) 
2.43* 13.451** 2.604 
Year x HE. deg. x C/S/BO 
(0.014) (0.05) (0.065) 
1.208*** aay dad -0.492 
Year x HE. deg. x TM/HQP 


(0.004) (0.004) (0.004) 
-1.934*** 0.613. —--0.995* 


Year x HE. deg. x Empl 
ear eg mployees (0.004) (0.004) (0.005) 


Year x HE. deg. x Factory -0.524 0.827 0.43 
workers (0.007) (0.006) (0.007) 
Year x HE. deg. x Mid-level 0.123 0.642* aLASe 
managers (0.003) (0.003) (0.004) 
FE education, occupation v v v 
FE gender, location, industry v v v 
Observations 32 700 23 226 17 301 
R? 0.059 0.071 0.066 


*p<0.1; **p<0.05; ***p<0.01 
Cells are empty where there were too few observations 


Observations are the sequences between years 1 and 6 
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HS: High School, HE: higher educ. 
TM/HQP: Top managers, Highly qualified prof. 
C/S/BO: Craftmen, Shopkeepers, Business owners 


4.2 Major-Occupation match quality 


Match quality between degree and employment plays an important role in the persistence 
of initial economic conditions’ effect on medium term wage (Liu et al. (2016)). By defining 
match quality by wage level in first job for each degree specialization in each industry, the 
authors establish that the deterioration of initial match quality match due to poor economic 
conditions in the United States in the 2010s (including high unemployment) has a downward 
impact on wage levels in subsequent jobs. This mechanism is particularly salient for higher 
education graduates, who are more specialized than their less educated peers. In France, in 
addition to the recession of the early 2010s, there has been an increase and diversification 
of the educational supply (Dupray and Moullet (2010)) among the 2004 and 2010 cohorts, 


which may also have affected the quality of initial matching the first job for these two cohorts. 


Table 1.11 shows the list of degree majors available to higher education students in 
France and their distribution among each cohort. Despite the diversification of educational 
provision, their distribution remains fairly stable between the 1998 and 2010 cohorts. There 
is however a decline in the ‘Mechanics, Electricity’ and ‘Electronics’ specialization and the 


rise of the ‘Mathematics and Science’ and ‘Social work’ specializations. 
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Table 1.11: Degree specialization shares by cohort 


Share of graduates (%) | Gen 1998 Gen 2004 Gen 2010 
Agriculture, fishing & woodland 4.4 4.4 3.8 
Civil engineering & Construction 4.7 5.1 4.3 
Communication & information i 6.8 4.8 
Community services 2.5 LS i 
Flexible materials 1 0.9 0.5 
General production 3.6 4.8 6 
General service 0.2 0.5 3.3 
General training 4.4 0.5 
Humanities & law 10.8 8.3 9 
Industrial transformations 5.6 5.9 4.9 
Literature & arts 4.7 4.7 5.1 
Mathematics & sciences ot 4.9 9.1 
Mechanics, electricty & electronics TS 12.1 7.9 
Personal services 15.7 18.9 19.7 
Trade & management 17.9 21.3 19 
Total 100 100 100 


Thanks to the detailed level of data, I can analyse the specialization-occupation match 
quality. It is defined by the following regression, performed only on contracts starting in the 


year in which each cohort enters the labour market, by cohort and education level: 


log Wijt = S- L fepec;=s] ye 1 [oce; =p) Ogeps a €ijts (1.8) 


Pp 
where spec;is the specialty chosen by individual 7 during their studies. The estimated 
coefficient dgeps is an average of the logarithm of the first year’s wage on the labour market, 
by cohort, education level, occupation, and degree specilization. To define a measure of 
matching, I look for the best matched specialization within a cohort, education level and 
occupation, i.e. the one for which average wage is highest: 


Ojep = max Osepe (1.9) 


Matching quality for a given degree specialization is defined by how far it stands with 


respect to the best matched specialization: 
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Deeps = — |Sgeps — Sgeps| « (1.10) 


geps 


Dayeps is always below or equal to zero, if s is the best matched specialization. The differ- 
ence between dgeps and dgeps for two specialisations s and s’ is interpreted as the percentage 
difference between the average hiring wage for s and for s’. The higher the absolute value of 
Dayeps, the farther average salary for the specialization s is from best matched specialization. 
Matching quality is then be said to be poor. Dgeps is a flexible measure of matching since 
one specialization may be mismatched with one occupation, but well matched for another. 
Matching quality is computed at cohort and education level, so that average earnings com- 


parison between cohorts are irrelevant to computing Dgeps. 


Individuals who are not hired in their first year on the labour market are not included 
in regression (1.8). These individuals are excluded from the analysis, which therefore covers 
only a subset of each cohort. Another definition for the period during which initial matching 


quality is computed is explored in the robustness tests. 


Table 1.12 shows the evolution of measure Dgeps by cohort and education level in terms 
of median and interquartile deviation, weighted by individuals. Since individuals with no 
degree do not choose a specialization, they are excluded from the present analysis. Although 
median quality of matching deteriorates for high school graduates, it increases for higher 
education graduates, while the interquartile gap is the same for higher education graduates 
between 1998 and 2010 cohorts (and narrower for the 2004 cohort). Worsening of matching 
quality does not therefore appear to be a factor in wage growth slowdown for higher education 
graduates. It remains to be determined whether, despite the consistent quality of matching 
across the three cohorts, its effect on the persistence of wage levels changed between the 
1998 and 2010 cohort. 


Table 1.12: Match quality: median and interquartile range by cohort and education level 


Education level | Gen 1998 Gen 2004 Gen 2010 
p50 eGo -0.16 -0.22 
High school degree 
p25-p75 [-0.33,-0.09] [-0.26,-0.09]  [-0.38,-0.12 
Short higher educ. p50 -0.23 -0.16 -0.21 
degree p25-p75 [-0.32,-0.1]  [-0.31,-0.05] — [-0.31,-0.09 
Long higher educ. po0 -0.38 -0.13 -0.23 
degree p25-p75 [-0.46,-0.16] [-0.26,-0.07]  [-0.32,-0.13 
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The impact of initial match quality on entry wages in subsequent years is assessed by the 


following regression, at cohort and education level: 


log wit = S  Wipedi aD dene Nee + Ay 4 Gi Tj Sj Ej jt- (1.11) 

Unlike the previous regressions, the aim is to obtain an estimated coefficient Nees dif- 
ferentiated per year. Within a cohort and education level, all individuals face the same 
conditions every year, captured by a fixed effect, hence the only variation between individ- 


uals in this regression is due to the difference in initial matching quality. 


The estimated coefficients Neces are presented in Table 1.13 for long higher education 
graduates only. Distinguishing between short and long higher education graduates matters 
in this analysis because degree specialization is closely linked to education level. Table 1.13 
shows significant intergenerational differences among long higher education graduates: in 
the first years on the labour market, initial match quality’s effect on entry wages is similar 


for all cohort. In year 1, a 1 percentage point increase in match quality, i.e. a .01 relative 


* 
geps? 


83% and .67% respectively). However the effect of initial match quality diverge between 


increase of Ora over 6 results in a wage increase of almost 1% for all generations (.72%, 


cohorts around year 4, since they are no longer significant for the 1998 cohort, whereas they 


persist until year 7 for the 2004 cohort and until year 8 for the 2010 cohort. 
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Table 1.13: Log entry wage regressed on match quality by year and education level - Long 
higher education graduates 


log entry wage 


Gen 1998 Gen 2004 Gen 2010 
Year: taste quality Of 1sr* 0.828*** 0.665*** 
(0.057) (0.052) (0.034) 
Year Sian analy 0.264* 0.223" 0.342*** 
(0.099) (0.092) (0.058) 
Year 3 x match quality iui ee “0.012 
(0.126) (0.096) (0.066) 
Year 4 x match quality 0.012 0.198** 279" 
(0.136) (0.093) (0.074) 
Year 5 x match quality ae Coa 0.363" 
(0.108) (0.122) (0.084) 
Year 6 <x match quality oer 0.343™ 0.087 
(0.128) (0.166) (0.093) 
Year 7 X match quality see 0.415" 0.297°" 
(0.159) (0.119) (0.086) 
Year 8 x match quality D209 “0.123 0.195™ 
(0.166) (0.161) (0.079) 
FE gender, location, industry i Gf V 
Observations 1 634 2 550 4 298 
R® 0.43 0.325 0.291 


*p<0.1; **p<0.05; ***p<0.01 


Only individuals who found a job in the first year after leaving school are included 


Initial match quality plays a significant role in slowing down the salary progression of long 
higher education graduates between the 1998 and 2010 cohorts, not because the quality has 
deteriorated, but because its impact on subsequent salary levels has increased. This could 
be explained by mobility: if the 2010 cohort would change jobs less frequently than the 1998 
cohort, the initial match quality may play a role in determining hiring wages for a longer time. 


However, average number of employment spells for long higher education graduates is 4.1, 
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4.5 and 4.4 for cohorts 1998, 2004 and 2010 respectively. The two last cohorts are changing 
jobs as much, if not more, than the 1998 cohort. To explore the mobility hypothesis in more 
detail, it is necessary to analyse the structure of transitions between occupations of each 
generation. If the 2004 and 2010 cohorts switch occupations less often than the 1998 cohort, 
initial matching quality’s impact may increase as individuals remain stuck in occupations 
to which their degree is not adapted. However, this hypothesis is again contradicted by the 
data: among the long tertiary graduates, those who never change occupation in the first 
seven years on the labour market account for 40%, 34% and 32% of the 1998, 2004 and 2010 
cohorts, respectively. Individuals changing only once accounted for 31%, 29% and 30% of 
each cohort. This shows changes in occupations are more common among the 2004 and 2010 
cohorts. Finally, the increased impact of initial match quality could be explained by the rise 
in the number of graduates in the labour market: mismatched individuals in the 2004 and 
2010 cohorts may find it harder to access higher-paying jobs for their degree specialization 
as each year a new cohort enters the labour market, increasing competition for the best jobs. 
Because the supply of long higher education graduates is smaller in the late 1990s and early 
2000s, the 1998 generation faces less competition and is able to make up for any low initial 


match quality. 


5 Robustness tests 


5.1 Sample representativity 


The Generation surveys only provide information on wages when individuals transition 
from a job to another, or when they transition from and to unemployment. As such, they 
constitute an unbalanced panel: some individuals are not observed in some years. An indi- 
vidual that transitions often makes up more observations than an individual who stays in 
the same spell over the period, and thus have a greater weight in the data. It is therefore 
important to check that individuals that go through few transitions, such as those hired in 
their first year on the labour market who remain in their jobs for the next seven years, ex- 
perience the same trend in wage growth between the generations 1998 and 2010. To do this, 
I perform two analyses: the first uses the wages observed during the last interview session 
of each generation. If the interviewee is employed during the last session, his or her current 
salary is reported as exit wage, even if his or her employment spell is not ending. This 
provides a cross-section of the entire population surveyed at the end of 2005, 2011 and 2017 
respectively for each of the 1998, 2004 and 2010 cohorts. Comparing wages in this cross- 


section by education level and across cohort is a way of checking all graduates are affected 
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by declining returns to experience. The second analysis looks at annual average exit wages 
over the seven years and examines whether entry wage growth slowdown is compensated for 


by pay raise during employment spells. 


Table 1.14 shows the average observed wages in the 2005, 2011 and 2017 cross-sections (in 
constant euro base 2017) by education level and cohort. These cross-sectional wages have 
decreased on average between the 1998 and 2010 cohort for higher education graduates, 
which confirms that wage growth has slowed for all individuals in the cohort, including those 


in long-term employment. 


Table 1.14: Average observed wage at end of survey, by cohort and education level 


Niveau Education Gen 1998 Gen 2004 Gen 2010 


Sans diplome 1325 1341 1356 
Diplome du secondaire 1499 1472 1508 
Diplome du tertiaire court 1918 1775 1826 
Diplome du tertiaire long 2902 2594 2567 


Finally, Figure 1.4 shows changes in average exit wages (i.e. the last wage received in the 
job) over time by education level and cohorts. These wages are higher than hiring wages for 
all cohorts and education levels but exhibit the same slowdown trend for the higher education 
graduates between 1998 and 2010 as entry wages. I conclude that pay raises on the job do 


not compensate for decreasing returns to experience at hiring. 
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Figure 1.4: Average real wage at employment spell’s end, by cohort and education level 
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5.2 Unobserved heterogeneity 


The results presented above are based on the identification assumption that the distri- 
bution of unobserved quality is there within each cohort. I test this assumption by using a 
proxy for unobserved quality, which is grade repetition before the start of secondary school. 
The Generation surveys provide individuals’ age in 6th grade (the first grade in secondary 
school). Normal age is 6th grade is 11 years old, hence if an individual is older when enter- 
ing 6th grade, I deduce they have repeated a grade in primary school. Repeating a grade 
before 6th grade indicates lower academic and learning abilities, which in turn affects the 
individual’s wage levels in the labour market. I am agnostic as to the causes of these lower 
abilities. 23.0% of individuals in the 1998 cohort, 12.1% of individuals in the 2004 cohort 
and 12.8% of individuals in the 2010 cohort repeated a grade in primary school. The practise 
of grade repeating scaled back over the period, hence the high number of individuals who 
repeat grade in the 1998 generation is only partly indicative of a lower average unobserved 


quality. 


To check the impact of unobserved quality on wage levels I introduce a dummy for class 


repetition in my baseline regression: 


log wijt = > W educpae) % Ge + OG + Oe +17 + Sy + Cay2, (1.12) 
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where a; is equal to 1 if the individual has repeated a grade before entering secondary 
school and 0 otherwise. Since class repetition is only an imperfect measure of unobserved 
quality, and its practise has evolved between the 1998 and 2010 cohort, I do not compare its 
effect on log wages between cohorts. Instead, Bes will be useful to understand if the effect 


observed in the baseline analysis is solely due to variation in unobserved quality. 


Regression estimation is presented in Table 1.15. Grade repetition has a significant 
and negative effect on wage for all cohorts. It does not significantly change the previous 
results however: the slowdown in wage growth for long and short higher education graduates 
between 1998 and 2010 remains qualitatively the same. This suggests that this slowdown is 


not due to unobserved quality variations between the 1998 and 2010 cohorts. 


Table 1.15: Log entry wage regressed on dummy for grade repeat and years spent on the 
labor market by education level 


log entry wage 
Gen 1998 Gen 2004 Gen 2010 


-0.06"** — -0.046"*" ~—-0.058*** 
(0.004) (0.006) (0.007) 
-0.01%* — -0.009"* = -0.007"** 
(0.001) (0.002) (0.002) 
gor o00r aol 
(0.001) (0.001) (0.001) 


Grade repeat 


Years x Sans diplome 


Years x High school degree 


Years x Short higher educ. 0.054*** 0.038*** 0.04*** 
degree (0.001) (0.001) (0.002) 
Years x Long higher educ. aN ae 0.099*** 0.084*** 
degree (0.002) (0.002) (0.002) 
FE gender, location, industry v v v 
Observations 37 785 27 599 19 992 
Re 0.29 0.218 0.229 


*p<0.1; **p<0.05; ***p<0.01 


Only individuals whose age is known in 6th grade are included 


A second measure of unobserved quality, specific to long higher education graduates, is 


e 


the type of school in which individuals obtained their degree, and whether they obtained a 
master’s degree or PhD. There are distinction in France between public university, where 
students can graduate both from a master and a PhD, and “Grandes Ecoles” , specific schools 
specialized in engineering or business, that deliver specific degrees. Engineering and busi- 
ness schools graduates often obtain higher wages than university graduates. Since business 
and engineering schools have traditionally been more selective than universities, it can be 
expected that they will continue to be selective even if more young people apply. In uni- 
versities, on the other hand, there is no selection at entry since any student can enrol for 
a bachelor’s degree (except in a few courses where seats are scarce). It may then be that 
the 1998 cohort count proportionally more engineering and business school graduates, driv- 
ing average wages upwards. Table 1.16 shows the distribution of types of degrees obtained 
among long higher education graduates by cohorts. PhD are not accounted for among the 
1998 cohort and the type of degree is not unknown for 1.5% of graduates in the 2004 cohort. 
The share of university graduates (master’s and doctoral degrees) indeed increased between 
cohorts 1998 and 2010. 


Table 1.16: Degree type shares among long higher education graduates 


Degree type (%) | Gen 1998 Gen 2004 Gen 2010 
Business degree 10.1 11.1 9.2 
Engineering degree 25.9 23.4 19.9 
Masters degree 64.4 Af 58.8 
Doctorat 16.6 Hea | 
Inconnu 1.5 


To check that the wage growth slowdown is not due to a composition effect on the type 
of schools between the 1998 and 2010 cohorts, I perform the following regression, only for 
graduates of the long higher education sector: 


log wei, = J Uaegrees=aEqa X O% + +éize; (1.13) 


d 

where log w?, is log wage cleaned of fixed effects for gender, region and industry that was 
computed in section 2 equation (1.2). Estimate End capture wage growth by degree type. It 
is presented in Table 1.17. For Individuals who graduated from business and engineering 
schools, where the selection should have remained stronger than at university, wage growth 
slowed as much as at university (-42, -33, -33 percentage points for business, engineering, 


and master’s graduates, respectively, between generations 1998 and 2010). The type of 
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degree does not capture unobserved quality that could be driving the wage growth slowdown 


observed between cohorts. 


Table 1.17: Log entry wage regressed on years spent on the labor market by type of degree 


log entry wage 
Gen 1998 Gen 2004 Gen 2010 


0.081*** 0.073" ~——0.038*** 
(0.005) (0.004) (0.004) 
0.065* 0.057" 0.042" 
(0.003) (0.004) (0.003) 
Gon ae’ coll 
(0.002) (0.002) (0.002) 

0.071* 0.063" 


Years < Business degree 


Years <x Engineering degree 


Years x Masters degree 


Years x Doctorat 


() (0.004) (0.004) 
FE gender, location, industry v v v 
Observations 2 787 3 835 5 891 
R? 0.105 0.091 0.053 


*p<0.1; **p<0.05; ***p<0.01 
Individuals whose degree type is unknown are excluded 


PhDs are not accounted for in Gen 1998 


5.3 Matching quality 


Finally, I test for a different period over which initial matching quality is computed. 
In the baseline, the period considered is the first year on the labour market, i.e. the first 
observation year of the entire cohort (i.e. 1998, 2004 and 2010 respectively). However, this 
definition is restrictive in that it does not always leave a full year to everyone. For example, 
if an individual graduates in June, it leaves only 6 months, between June and December, 
to observe a first hire. The benefit of the baseline definition is to ensure that individuals 
face the same conditions in the labour market over a limited period, but it may neglect first 
hires for individuals finding their first job early in the year after the entire cohort leaves the 
education system. The alternative period definition considers the first year on the labour 


market at individual rather than cohort level: for instance, if an individual graduates in 
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June 2010, their first year on the labour market runs from July 2010 to June 2011. I then 


carry out the same analysis to study the impact of initial matching quality on wage growth. 


The results of the analysis using modified matching quality are presented in Table ??. 
These results are qualitatively similar to the baseline: the 2010 cohort experiences a longer 
impact of initial matching quality on wage levels than the 1998 generation. However, the 
effect is shorter for cohorts 2004 and 2010, since it is no longer significant from years 4 and 
6 (compared to 7 and 8 in the baseline analysis). R? are higher in the reference regression 
for the 1998 and 2004 cohorts, and almost equal for the 2010 cohort despite the increase in 
the number of observations between the reference regression and this one. Hence the shorter 
effect may be due to a loss in precision due to the new definition of initial period, that 


doesn’t hold initial job market conditions constant. 
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Table 1.18: Log entry wage regressed on alternative match quality by year and education 
level - Long higher education graduates 


log entry wage 


Gen 1998 Gen 2004 Gen 2010 
et i lit 0.944*** 0.864*** G,62""* 
ear 1 x match quali 
oe (0.048) (0.044) (0.025) 
; 0.529*** 0.486*** 0.315** 
Year 2 x match quality 
(0.061) (0.058) (0.033) 
; 0.488*** 0.501*** 0.222*** 
Year 3 x match quality 
(0.071) (0.074) (0.041) 
ee i lit 0.09 -0.003 0.236*** 
ear 4 <x match quali 
oo (0.092) (0.07) (0.046) 
: -0.1 0.041 0.241*** 
Year 5 x match quality 
(0.093) (0.086) (0.056) 
Year 6 ne lit -0.029 0.042 0.089 
ear 6 X match quali 
eae (0.099) (0.11) (0.059) 
; 0.101 -0.249*** -0.022 
Year 7 x match quality 
(0.102) (0.096) (0.055) 
-0.019 -0.538°** 0.018 
Year 8 x match quality 
(0.109) (0.111) (0.054) 
FE gender, location, industry v v v 
Observations 5 559 4 721 5 116 
R? 0.188 0.202 0.3 


*p<0.1; **p<0.05; ***p<0.01 


Only individuals who found a job in the first year after leaving school are included 


6 Conclusion 


The Generations Surveys highlight a delay in wage growth that affect cohorts leaving 
the higher education system in 2004 and 2010 compared to the cohort who graduates in 
1998. I decompose the wage growth slowdown by occupation in two margins: an extensive 


margin, which reflects changes in the distribution of occupations within each cohort, and an 
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intensive margin, that captures changes in hiring wages by occupation between cohorts. I 
perform the decomposition at a finer level of occupation and find a clear heterogeneity in 
the extensive margin between occupations in middle management, and top management and 
highly qualified professionals. Among these two categories, occupations which experience 
the largest influx of higher education graduates between the 1998 and 2010 cohorts are also 
those for which the intensive margin is largest. It suggests the influx of young graduates 
has not increased the productivity of companies as much as their senior counterparts, which 
has impacted their wage growth downward. I further explore possible mechanisms in line 
with this interpretation: access to manager positions and initial match quality. I find both 


mechanisms play a role in the young higher education graduates’ wage growth slowdown. 
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CHAPTER 2 


Education Expansion, Sorting, and the Decreasing 


Education Wage Premium 
Abstract 


This chapter studies the interplay between worker supply and firm demand, and their 
effect on sorting and wages in the labor market. I build a model of one-to-many 
matching with multidimensional types in which several workers are employed by a sin- 
gle firm. Matching is dictated by worker preferences, their relative productivity in the 
firm, and substitution patterns with other workers. Using tools from the optimal trans- 
port literature, I solve the model and structurally estimate it on Portuguese matched 
employer-employee data. The Portuguese labor market is characterized by an increase 
in the relative supply of high school graduates, an increasingly unbalanced distribution 
of high school graduates versus non-graduates across industries, and a decreasing high 
school wage premium between 1987 and 2017. Counterfactual exercises suggest that 
both changes in worker preferences and the increasing relative productivity of high 
school graduates over non-graduates act as a mitigating force on the decreasing high 
school wage premium, but do not fully compensate for high school graduates’ rise in 


relative supply. 
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1 Introduction 


Between the 1970s and today, many economies both in the developed and developing 
world have experienced an increase in their educated labor supply. As a result, the ratio of 
educated workers (whether high school or college-educated) to uneducated workers present 
in labor markets has risen. The shift in labor supply’s education level has induced broad 
changes in labor markets, both in terms of workers’ allocation to firms and wage structure. 
Workers-firm allocation, or matching, refers in this chapter to how workers of different ed- 
ucation and experience level sort with firms in different sectors. It generates dispersion in 
the wage structure through its impact on the output produced by a firm and its workforce. 
The output is determined both by the worker-firm match and the match between a worker 
and his co-workers. In particular, workers-firm allocation has repercussions on the wage 
returns to education. This paper seeks to provide a theoretical framework to understand 
how changes in labor supply affects matching between workers and firms, and through this 
channel, impacts returns to education. It proposes a novel model of matching on the labor 
market in which a single firm matches with several workers. The model is structurally esti- 
mated on Portuguese matched employer-employee data. In doing so, I am able to quantify 


the impact of supply and demand changes on worker-firm allocation and wage structure. 


The mechanisms driving matching between workers and firms and the resulting wage 
distribution are two-sided. On the one hand employed workers with various education and 
experience interact within the firm to produce an output, whose level depends on a produc- 
tion function that is particular to each firm’s sector. Given their production function, firms 
seek to hire a workforce, which is a mix of workers with different characteristics, to maximize 
their profit. On the other hand, workers have preferences for the tasks performed on the job, 
which vary from one sector to another. Worker preferences impact which sector they are 
willing to work in. Given distributions of education and experience in the worker population 
and sectoral composition among firms, firm production requirements and worker preferences 
result in a given level of sorting and wage gaps. Sorting is the result of worker-firm allocation: 
it is the ratio of educated to non-educated workers in each sector. Wage gaps summarize the 
wage distribution: they are the ratio of educated workers’ to non-educated workers’ average 


wage. 
To capture these mechanisms, I build a static one-to-many matching model with trans- 


ferable utility. Workers and firms differ with respect to their observed characteristics, which 


are summarized by a multidimensional type, as well as a stochastic shock that accounts 
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for unobserved heterogeneity. A single firm matches with several workers, who constitute a 
bundle that forms its workforce. The surplus created by the match depends on the firms’ 
observable characteristics as well as the workforce. The utility is transferable under the form 
of wages paid by the firm to the workers in its workforce. Firms seek to maximize total 
profit, which is additive in the difference of production and total wage bill, plus random 
shocks. Workers maximize their utility, which is additive in amenities, wage, and a random 
shock. Amenities embody workers’ inner preference for a given type of firm. At equilibrium, 
wages clear the market and each agent matches with their best option given wages. The 
model can generate a rich distribution of wages that depend both on workers’ and firms’ 
observable characteristics, as well as on the employed workforce. It also predicts equilibrium 
matching, which is the joint distribution of firms and workforces. Using both matching and 


wages, I can separately identify firm production from workers’ amenities. 


The framework offers more flexibility in estimation than classic supply and demand mod- 
els developed in Katz and Murphy (1992) and Card and Lemieux (2001): it identifies worker 
preferences in addition to firm production, as well as varying production parameters over 
time, which allows for non-linearities in the evolution of firm production paramaters. This 
is because by explicitly modeling firms’ and workers’ match choices, I can use both observed 
matching and observed wages, which brings more power to identification. The model is fit- 
ted to the data by assuming parametric forms for firm production and workers’ amenities. 
I classify workers into two education levels, high school graduates and non-graduates, and 
three age groups, young, middle-aged, and senior. Firms are differentiated by their sector 
of activity. Following the literature, I choose a nested Constant Elasticity of Substitution 
(CES) function for production, with productivity parameters for each education level that 
vary between sectors. I assume worker preferences for firms depend on a worker’s age, ed- 
ucation level, and firm sector. Equipped with model predictions for matching and wages, I 
structurally estimate the model on matched employer-employee data. I estimate the model 
by maximum likelihood on the joint distribution of matching and wages, separately every 


three years. 


The model developed in this paper is related both to one-to-many assignment problems 
studied in mechanism design (Bikhchandani and Ostroy (2002), Vohra (2011)), and to one- 
to-one matching models used in family economics (Choo and Siow (2006)). This paper 
bridges the gap between these two literatures: it extends one-sided assignments to two-sided 
matching, and generalizes one-to-one matching to one-to-many. Additionally, I extend the 


econometric framework of Choo and Siow (2006) and Galichon and Salanié (2021) to one- 
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to-many matching. 


I use the novel theoretical framework developed to study the Portuguese labor market 
between 1987 and 2017. I highlight three facts on the Portuguese labor market: first, the 
country operates a vast education expansion over the period, which translates in a dramatic 
increase in the relative supply of high school graduates to non-graduates on the labor mar- 
ket. Second, the high school wage premium decreases over the period. The high school 
wage premium is defined as the wage gap between workers who graduated from high school, 
and those who did not. The decrease in wage premium is particularly stark among young 
workers. Third, I measure worker-firm sorting, which is defined as the relative number of 
high school graduates over non-graduates in an age group employed in a given sector. The 
distribution of high school graduates versus non-graduates across industry sectors becomes 
highly unbalanced, in favor of services, and transports and communications, who employ 
an increasing share of high school graduates. The former two facts imply relative supply of 
high school graduates over non-graduates has grown faster than firms’ relative demand for 
high school graduates over non-graduates. The latter suggests that sorting between workers 
and firms has evolved over the period: either because firms in services and transport and 
communications demand an increasing share of high school graduates, or because high school 


graduates’ preference for these firms strengthens. 


Portugal is a particularly relevant example of rapid supply and demand changes on the 
labor market: it entered the European Union in 1986, which fuelled its economy’s transition 
from being dominated by manufacturing (50% of the labor force employed in 1987), to ser- 
vices (30% of the labor force employed in 2017). Meanwhile, only 10% of its employed labor 
force held a high school degree in 1987, a percentage that has risen to 50% in 2017. As a 
point of comparison, the percentage of high school graduates in the US workforce has gone 
from 75% to 90% over the same period!. The proportional increase of high school graduates 
in Portugal is more extensive and starts from a much lower share of high school graduates 
on the labor market than in the US. In this respect, it is closer to the change in university 
graduates on the US labor market (from 20% to 35% over the same period). Graduating 
from high school has become much more common in Portugal over the last thirty years, 
but it is only in 2007 that high school graduates start representing the majority of young 
workers between 25 and 30. In 2017, 32% of the young workers between 25 and 30 still do 
not hold a high school degree. Meanwhile, university graduates in Portugal represented less 
than 3% of the employed labor force in 1987 and about 19% in 2017. Because the share of 


'Percentages computed over workers aged more than 25, Census data 
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university graduates remains small for most of the period (it only reaches 10% in 2005), and 
because graduating from high school is still quite uncommon over most of the period I study, 
I consider a high school degree to be a differentiating signal in skill on the Portuguese labor 


market, much as a college degree is on the US labor market. 


I find that relative demand for high school graduates from firms in the Services, Man- 
ufacturing, and Transport & Communications sectors has increased dramatically over the 
period, starting in the early 2010s. This finding is in line with the skill-biased technological 
change hypothesis. I also find that young and middle-aged high school graduates’ prefer- 
ence for these industries has declined over time, while their share in production increases 
compared to senior workers. Compared to the classic supply and demand framework, these 
observations offer two additional mechanisms whereby high school wages gaps stay positive 
when a large number of high school-educated workers enter the labor market. First, a de- 
crease in workers’ amenities pressures wages upwards. Second, variation in young graduates’ 
share in production compared to more senior high school graduates increases firm demands 
for the former compared to the latter. I perform several counterfactual exercises to assess 
the separate actions of changes in workers’ demographics (both in education and age distri- 
bution), firm sector composition, firm demand through production parameters, and worker 
preferences, on sorting and wage premium. I find that changes in demographics are the 
main positive drivers of changes in sorting. Changes in industry composition, firm demand, 
and worker preferences overall have a negative, but modest, effect on sorting. Wage premia 
by age group and industry are negatively affected by changes in worker demography and 
industry composition and positively affected by changes in firms’ demand. These suggest 
changes in relative productivity in favor of high school graduates have driven the high school 
wage premium up, but cannot compensate for the large increase in the relative supply of 


graduates versus non-graduates. 


Related literature. The theoretical tools developed in this paper belong to the matching 
literature started by Becker (1973). My model is a one-to-many extension to the seminal 
work of Choo and Siow (2006) in the one-to-one case. As in Dupuy and Galichon (2022) 
and Galichon and Salanié (2021), it explicitly borrows tools from the optimal transport 
literature to introduce unobserved heterogeneity in the form of random utility and relies on 
Gretsky et al. (1992) to show equilibrium existence. This paper is also close to the hedonic 
model literature (Ekeland et al. (2004), Heckman et al. (2010)). A discussion of the links 
between hedonic models, matching with transferable utility, and optimal transport can be 


found in Chiappori et al. (2010). My work is also related to the seminal paper by Kelso 
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and Crawford (1982), and more recent work by Che et al. (2019) on one-to-many matching 
with non-transferable utility and Azevedo and Hatfield (2018) on one-to-many matching 
with transferable utility. They both show the existence of equilibrium for a large class of 
firm preferences, under a large market assumption, which I also use in this paper. I take 
one-to-many matching models a step further by taking the model described in this chapter 
to the data by introducing random shocks that account for unobservables and estimating it. 
The mechanism design literature has also explored many-to-one assignment problems in a 
one-sided framework with work by Bikhchandani and Ostroy (2002) and Vohra (2011). 

This framework differs from the Sattinger model (Sattinger (1979), Sattinger (1993)) 
that assumes no unobserved heterogeneity and rests on the the firm’s production function’s 
supermodularity to find the optimal assignment of workers to firms. Fox (2010b) discusses 
non parametric identification of production functions in matching games and Fox et al. (2018) 
show that unobserved heterogeneity distribution can be recovered in matching games in 
which unmatched agents are observed and agents match on many separate markets. Because 
static random utility models (including mine) do not follow agents over time, they do not 
identify the unobserved heterogeneity distribution in the fashion of Abowd et al. (1999), 
Bonhomme et al. (2019), Bonhomme (2021) and instead focus on match formation based on 
observable surplus. 

The model I develop features sorting between multidimensional types and as such is 
also related to Choné and Kramarz (2021), Lindenlaub (2017) and Lise and Postel-Vinay 
(2020). However, it is only remotely related to the search literature to which the latter 
paper belongs, as it focuses on relative supply and demand instead of search frictions. While 
the search literature often relies on Nash bargaining mechanisms, as in Shimer and Smith 
(2000) and Cahuc et al. (2006), the present model uses wage posting, as the competitive 
equilibrium in the model rests on wages that clear the labor market. Also related to this 
model and its application is the Roy model developed by Hsieh et al. (2019) to quantify the 
productivity gains of weakening discrimination barriers to women’s and black men’s entry 
into the labor market in the US. There exists an extensive literature on the education wage 
premium, mostly focused on the college wage premium in the US. Seminal work by Katz 
and Murphy (1992) shows that the increasing supply of college graduates in the 1970s and 
1980s is absorbed on the US labor market by increased demand for these workers from firms. 
Card and Lemieux (2001) carry out a similar analysis that further differentiates workers 
by age, and show that young college graduates are the first to benefit from the slowdown 
in educational attainment in the 1980s. Goldin and Katz (2008) and Autor et al. (2020), 
among others, relate changes in the US wage structure to the race between education and 


technology, by which skill-biased technological change favors college graduates. Skill-biased 
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technological change (SBTC) origins in the development of new technologies, in particular 
computers (Autor et al. (1998), Autor et al. (2003)). However, if the SBTC hypothesis has 
proven a powerful explanation for the quick increase in graduate wage premium of the 1970s 
and 1980s, it is less clear if it can rationalize the subsequent slow down of both graduate 
wage premium and graduate supply in the 1990s, when the use of computers became preva- 
lent (Card and DiNardo (2002)). The recent stagnation of the college wage premium in the 
US is also documented in several papers, and several explanations have been put forward: 
Beaudry et al. (2015) argue that the demand for cognitive skills has decreased since the 
early 2000s, pushing graduate workers down the job ladder. Valletta (2016) also emphasizes 
the role of job market polarization, i.e. the shift away from middle-skilled occupations, on 
college graduates’ wages (as opposed to postgraduates). On the contrary, Blair and Deming 
(2020) examine job vacancy data and find that demand for skills has increased since the 
Great Recession. They explain the stagnating graduate wage premium by an increase in 
the supply of new graduates after 2008. They are backed by Hershbein and Kahn (2018) 
who show that the Great Recession has accelerated skill-biased technological change. In 
Portugal, changes in the wage structure are documented by Cardoso (2004), Centeno and 
Novo (2014) Almeida et al. (2017). To the best of my knowledge, I am the first to analyze 


the implications of worker and firm sorting on the education wage premium. 


Outline. Section 2 describes the one-to-many matching model. Section 3 describes the evo- 
lution of the Portuguese high school wage premium between 1987 and 2017. Section 4 dis- 
cusses the model’s identification and estimation on Portuguese matched employer-employee 


data, and section 5 presents estimation results. Section 6 concludes. 


2 Model 


Recent administrative matched employer-employee datasets hold much more information 
than workers’ characteristics and wage. They also inform on firms’ characteristics and on 
matching, i.e. the joint distribution of workers and firms. Besides matching, the data also 
provides transfers between agents in the form of wage. Relying on this type of dataset enables 
to build a rich supply and demand framework to understand the race between education 
and technology. I build a one-to-many matching model where a single firm matches with 
several workers, who interact within the firm to produce output. Workers are compensated 
through wage, and hold specific preferences for different types of firms. Workers may also be 
unemployed. Firms maximize their profit, given their production function that is specific to 


their type and market clearing wage. Both worker and firm types are observed, and possibly 
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multidimensional. The model is an extension of Choo and Siow (2006) to a one-to-many 
framework, and existence of equilibrium rests on a large market assumption, as in Azevedo 
and Hatfield (2018) and Galichon and Salanié (2021). I model unobserved heterogeneity in 
the form of additive random utility. The social planner problem rewrites as a regularized 
optimal transport problem (Galichon (2016)), and I am therefore able to derive closed-form 


solutions for predicted matching and wage. 


2.1 Setup 


The labor market is two-sided, with workers and firms on each side. There is a contin- 
uum of workers i € J. Each worker has a type x € 4. Types are discrete and possibly 
multidimensional. There is a mass n, of workers of type x, and a finite number of types: 
#X = X. On the other side of the market, there is a large number of firms 7 € J. Each firm 
has a type y € Y. As for workers, firm types are also discrete and possibly multidimensional. 


There is a mass m, of firms of type y, and a finite number of types: #Y = Y. 


Each firm matches with a non-negative number of workers of each type, while each worker 
matches with a single firm. Let k, be the number of type x workers a firm is matched with. 
The model is scaled by factor F’, meaning that (n,m) and (F'n, F'm) are observationally 
equivalent. Hence the actual number of type x workers on the market is F'n,. Therefore k, 
must be comprised between 0 (a firm cannot hire a negative number of workers), and F'n,. 
Vector k represents the workforce employed by the firm. It is akin to a bundle of workers of 
each type: 

k = (ki,...,kx) € [0, Fini] x [0, Frnx]. 


Type x worker’s utility for being employed at type y firm within workforce k is uzyg. 
It is additive in a level of amenity a that depends both on worker and firm type, as well 
as workforce, and in wage w paid by the firm to the worker. Wage wz,, is also allowed to 


depend on worker type, firm type and workforce. 
Uryk = Aryk oe Weryk- 


Every worker also has the option to remain unemployed and obtain uz = 0. 


Similarly, the firm profit v,, is additive in production y and minus total wage bill paid 


to its workforce. x 


Vyk = Vyk — Y MiaAllisaiie: 


x=1 
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Both amenity Qz,, and Yzy, are functions of x, y, k and take their value in R. The total 


surplus from a match between a firm and a workforce is the sum of workers’ utilities and 


firm’s profit 
x 


Gy — S- Keon Obagi tr YVyks (2.1) 


=I 


where wages have canceled out because they are modelled as perfectly transferable utility. 


Some characteristics of firm and workers which play a role in match formation are un- 
observed, and therefore are not accounted for in x or y. There exists a large literature that 
deals with unobserved heterogeneity, and I build on a large subset (Choo and Siow (2006), 
Dupuy and Galichon (2014)) that uses additive random shocks to model it. I further assume 
a logit framework for the model by restraining the distribution of shocks to belong to the 
extreme value class, although as shown in Galichon and Salanié (2021) in the one-to-one 


case, identification is possible with a general class of distributions. 
Worker i experiences stochastic shock (€jy,)y,, in addition to their systematic utility: 


Uxjyk lp EG ik : 


Similarly firm j experiences stochastic shock (17;,), in addition to its systematic production: 


Vysk + ENjk- 


where € is a scaling factor for unobserved heterogeneity. I impose the following indepen- 


dence conditions on stochastic shocks. 
Assumption 2.1. Stochastic shocks satisfy the following: 


(i) For each pair of two workers i and i’, €iyx and Evy, are mutually independent and 
identically distributed. 
(ii) For each pair of two firms j and j’, nj, and nj, are mutually independent and identically 
distributed. 
(iii) For a worker i and a firm j, €iyx and nj, are mutually independent. 
(iv) €iy, is independent of Az,yk, Nn ts independent of Yyx. 


(v) (€iyk)y,n and (njx)~ are distributed as extreme value 1 (Gumbel distribution). 


A market is characterized by exogenous distributions of worker and firm types (nz)rex 


and (my)yey, as well as amenity functions (Qzy)rex, yey, production functions (yy)yey, and 
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a draw of stochastic shocks € and 7. In the next subsection, I describe workers and firms 


choices and the resulting competitive equilibrium. 


2.2 Competitive Equilibrium 


Next, I define workers and firms expected utility and profit from choosing their best 


employer and/or workforce, given wages. 


Definition 1. Type x worker’s expected indirect utility G, as a function of u and type y 


firm’s expected indirect utility Hy, as a function of v are 


G,(u,) =E ana 1 Usaie ot cen Se} and H,(vy) =E |max Lge Ney | 


Under assumption 2.1, expected utilities rewrite in closed form. 


Proposition 1. Under assumption 2.1, expected indirect utilities write 


Gz(ur) = € log ( +e Y= exp (“e)) and =H, (vy) = €log §” exp (%#) 
y ok k 


where >on = dom ++ Duke: 


Proof. In Appendix C. 


The equilibrium on a market is found when supply from workers meets demand from 


firms. Supply and demand are defined as follows: 


x 
yk 


x workers willing to match with type y firm and workforce k and Sj is the mass of type x 


Definition 2. Type x worker’s supply is a vector (S.) yk,0 where is the mass of type 


workers willing to remain unmatched. 


Type y firm’s demand is a vector (DZ), where Di is the mass of type y firms willing to 


match with workforce k. 


I model unemployment through Sj, which is determined at equilibrium. I assume no 


counterpart on the firm side: all firms must be matched to a given workforce. 


Assumption on stochastic shocks lets us express supply from worker and demand from 


firms in logit form. 


90 


Proposition 2. Under assumption (2.1), the mass of type x workers willing to supply type 


y firms in workforce k is 


: eXP (Uy 
Ee (“oyh) (2:2) 
Ls SS exp (Unyk) 
y,k 
The mass of type y firms who demand workforce k 1s 
Di (ig (2.3) 
> exp (dye) 
k 
Proof. In Appendix C. 
Note that supply S and demand D both depend on wage schedule w = (Weyl) » yb Be- 


cause both workers and firms care not only about the other side’s type, but also about the 
workforce they work with both in the systematic and stochastic parts of their utility or 
profit, wages also depend on workforce k. Therefore, two type x workers employed in two 
firms of same type y but who hire different workforce k and k’ do not receive the same wage, 
aS Wayk F Wayk in general. The model is able to generate heterogeneity in wage depending 


on firm size and workforce composition. 


In the context of one-to-many matching, supply S and demand D are measured in dif- 
ferent ‘units’: if a firm can match with several workers types, workers can only match with 
one firm type. Excess demand Z defined below gives the equivalence between worker and 


firm units. 
Definition 3. Given types x, y and workforce mass k, excess demand is defined as 


A competitive equilibrium is reached on the market when supply and demand are feasible, 
matching is incentive compatible, and excess demand is zero. The first two conditions are 
automatically filled as a byproduct of the definition of supply and demand: in proposition 2, 
workers and firms choose their optimal option. As a result, matching is incentive compatible, 


and supply and demand are feasible: 
S- yk +59 = Ne and S_ DE Sani: 
y,k 


Definition 4. An equilibrium outcome (S,D,w) satisfies Vx, y,k: Zry,(w) = 0. 
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The existence a competitive equilibrium rests on the fact that there are large numbers of 
agents on the market. To show existence, I follow a proof technique introduced in the contin- 
uum assignment problem by Gretsky et al. (1992), and already used for one-to-one matching 
markets by Galichon and Salanié (2021). The reasoning is also very close to Azevedo and 
Hatfield (2018)’s proof for competitive equilibrium existence in a large economy on a market 
of buyers and sellers with a finite set of possible trades. Bikhchandani and Ostroy (2002) 
explore a similar assignment problem but do not assume large markets and work without 


heterogeneous shocks. 


I prove existence of equilibrium in two steps. First, I show that the competitive equilib- 
rium reframes as an optimization problem on total welfare. Second, I show that this problem 
is the dual of the social planner problem, who maximizes total surplus under feasibility con- 
ditions. The social planner problem maximizes a continuous and strictly concave function 


over a compact space. As such, a unique solution exists. 
Theorem 2.1. Equilibrium payoffs obtain as solutions to the following problem: 
inf S- NeGe(Ue) + > (gtd gl By) 
A | 7 y 


(2.4) 
8.t S > ke tayk + Uyk = Oye VE, y. 


Proof. In Appendix C 


Theorem 2.2. Equilibrium matching by, = Di = “uk? Va and equilibrium Sj obtain as 


solution to the social planner problem: 


H,S0 


st S > S- kebtye + 55 = Ne (2.5) 
y k 
SS lyk = My, 
k 


Equality py, = pus is only defined when k, > 0. If k; = 0, supply Sj, is not defined 


max » > Dyk Lyk a EE( pu, n, m) 
y k 
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where E(p,n,m) is equal to 


E(p,n,m) = “Ls, Robb 1, ptt — Sn, log 2 
-Som¥ Be _ 


The solution to (2.5) exists and is unique. 


Proof. In Appendix C. 


Theorem 2.2 shows that equilibrium matching can be obtained by solving a penalized 
social planner problem, where the objective function is the difference between total expected 
surplus and an entropy term due to unobserved heterogeneity. It is reminiscent of the dis- 
crete regularized optimal transport problem (Galichon (2016)). However it differs from the 
usual transport problem in two important ways: first workers are allowed to remain un- 
matched through Sj, and second, the first marginal condition > ‘i Se helagn Oy = My not 
a condition on the marginal distribution of k, which is endogeneous, but on the marginal 


distribution of worker types. 


Solving for problem (2.2) yields the following expressions for equilibrium matching yp, 


unemployment 55 and wages w. 


Proposition 3. Equilibrium matching solves 


— ig kee — Vy + € Diy ke log G2 + € log my 
losin = = EL + 3s, hi) 
—U, + log nz 
——-  s 


log Sj = 


Equilibrium wages write 


Yyk — Anyk + Uz — Vy + € log my — € log F 


Wayk = ALES) . 
Dial pa Ke’ ((aays = Guy) = (Ug = Uz) +E log = ule fe) (2.7) 
E(1+ >), kz) 
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Where U,, Vy solve 


By — oy, kar —Vy t€ Dy, ke log( ## )+€ log m pe “ 
Sok Kn XP | E45, Fe) <2 ; +exp ( a a ) eee 
(2.8) 
By — Dog kaU2—Vyt€ Yon ke log( B®) +E logmy \ 
yn EXP E(1+9., ka) = Tgp: 


Proof. In Appendix C 


In practise, equilibrium jz, Sj and w are computed by solving for equations (2.8) using 
the Sinkhorn algorithm, also called IPFP (Iterative Proportional Fitting Procedure), that 
has been developed in the optimal transport literature (among others). In the one-to-many 


case, U, and V, can be solved for by coordinate update in the same spirit as Sinkhorn. 


2.3. Links with search and matching models in the literaure 


The model I develop is akin to Choo and Siow (2006)’s in a one-to-many instead of a 
one-to-one setting. One can view the space of workforces, instead of workers, as a side of the 
market, with firms on the other side. It is particularly striking that just like in Choo and 
Siow (2006), both equilibrium matching and wage are weighted by the number of individuals 
in the match 1+ 5°, k,. In this representation, the model almost reduces to the one-to-one 
framework, but for the specific shape of marginal conditions in (2.8), that links the matching 
over workforces and firms back to the number of workers of each type. Another difference 
with Choo and Siow (2006), Dupuy and Galichon (2022) and other frameworks that use the 
IPFP algorithm in their framework is that expected indirect surpluses U and V cannot be 
explicitly expressed through equations (2.8) because the size of every match is endogenous. 
I observe transfers as wages and can leverage them to split total match surplus between 


workers and firms, in the spirit of Dupuy and Galichon (2022). 


The model also features wage posting. In the decentralized equilibrium, firms choose 
among workforces and associated wages given their draw of random shock 7, while workers 
choose among firm types, workforces and wages given their draw of «. A salient feature of 
the model is that it generates wage dispersion for a given worker and firm type, based on 
the workforce hired by the firm. All other things equal, wage is increasing in the number 
of workers hired by the firm. This is reminiscent of search models such as Burdett and 


Mortensen (1998), although the model presented here is not a search model. 


Finally, my model is closer to Katz and Murphy (1992) and Card and Lemieux (2001) 
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than it may appear at first sight. To see this, consider two workforces k and k’, where 
ki. = kz, expect for ki = kz +t, ie. there is t more worker of type Z hired in workforce k’. 


Then firm production and type % worker’s wage satisfy: 


YVyk — YVykt = (: + ys .) Wyk — (: + S- «) Weyk! - 


At the limit, when t tends to zero (if the extra worker works very few hours for instance), we 
obtain the same intuition as with the representative firm that the marginal change in wage 


is equal to the marginal change in production (divided by the number of agents): 
uk OWayk 
ai iat kz | —©. 
e ms Oks 


Hence any change in workers’ 7 is proportional to their marginal productivity, although 


its impact is mitigated by total number of workers hired by the firm. 


3 Empirical Evidence 


3.1 Data Description 


The Quadros de Pessoal dataset offers an exhaustive snapshot of the Portuguese labor 
market every year from 1987 to 2017. It covers all employees in the private sector (except 
domestic workers), and provides information on their age and highest degree obtained, as 
well as their monthly wage and hours worked. To compute the high school wage premium 
by age, I part the worker population into two groups: those who did not graduate from 
high school, and those who did. I also categorize workers into three age groups: young 
workers (from 16 to 35 years old), middle aged workers (from 36 to 50 years old), and senior 
workers (from 51 to 68 years old). I only consider full time employees, that is, workers that 
are neither part time workers (approximately 10% of the observations) nor self-employed, 
in unpaid family care, or in other forms of employment (less than 1% of the observations). 
I compute real hourly wage as the ratio of monthly wage over monthly hours, controlling 
for inflation and clean out the lowest 1% and highest 99% hourly wage percentiles. Firms 
belong to either five sectors, or industries: primary industries (agriculture, mining, energy, 


construction), manufacturing, retail and hospitality, services, transport and communications. 


To account for unemployment, I use public yearly unemployment figures by education 
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level and age group provided by INE®°. Information on unemployment is missing between 
1987 and 1991, hence I assume the unemployment rate in these years is the same as in 1992. 
I compute the number of unemployed workers each year by education level and age group by 
combining unemployment rates and the number of observed employed workers in Quadros 


de Pessoal. In what follows, active worker refers to workers either employed or unemployed. 


3.2 Empirical facts 


The Portuguese labor market is characterized by three facts between 1987 and 1997. The 
first is the dramatic increase in the number of high school educated workers, compared to 
the number of workers who did not go to high school. The second is the decrease in high 
school wage premium, i.e. the wage gap between high school graduates and non graduates. 
The third is the change in sorting between education level on the worker side, and industry 
on the firm side: sorting intensity between high school graduates and specific industries rises 


over the period. Each of these three facts are detailed below. 


Fact 1: Education supply. Supply of high school graduates relative to non-graduates 
rises dramatically over the period, as evidenced by Figure 2.1. Relative supply is measured 
as the ratio of number of high school graduates over number of active school graduates by 
age group in each year. Because high school enrolment grows every year, young workers are 
more impacted by this growth, and their relative supply goes from .12 to 1.79 on Figure 2.1, 
meaning high school graduates have grown to be about eight times less numerous to almost 


twice as numerous as non-graduates between 1987 and 2017. 


3Found on their website 
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Figure 2.1: High school graduates versus non graduates relative supply, by age group 
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— 16-34 ~ 35-54 — 55-68 


Fact 2 : Wage premia by age group. The second fact that characterizes the Por- 
tuguese labor market is the decrease in high school wage premium. To compute high school 


wage premium by age group, I estimate the following equation by OLS: 


log Wijt = S- Lis graduate, )|Oajt + Gi + UV 5t - dit ar Uijts (2.9) 
ai€{y,m,s} 


where each individual 7 working in firm 7 at time t earns wage w;;. a; is individual 7’s 
age group: either y, m or s. IjHg graduate,) equals 1 if 7 graduated from high school, and 0 
otherwise. g;, rj, and dj, are gender, region and industry fixed effects. G1 is the yearly high 
school wage premium, differentiated by age group: it measures how much more in percentage 
a high school graduate earns compared to a non high school graduate. I allow fixed effects 


to vary over time, I estimate (2.9) separately every year. 


Figure 2.2 shows the change in estimated high school wage premium over time for each 
age group, along with 95% confidence intervals. The high school wage premium differs 
between age groups: the wage gap is much higher (between 60% and 80% over the period) 
for senior workers than for younger workers (between 40% and 20%). Figure 2.2 also shows 
that the wage premium decreases for all age groups between 1987 and 2017. The extent 
of the decrease is different depending on age however: senior workers lose only about 17 
percentage points (p.p) in high school wage premium over the period, while young workers 


lose almost 50p.p and middle ages workers lose slightly less than 30p.p. 
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Figure 2.2: Estimated high school graduate wage premium by age group 
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16-34 = 35-54 55-68 


Wage levels differ by gender, with men earning more on average than women in all educa- 
tion levels and age groups (see Figures 2.9 and 2.10 Appendix B). Yet, both men and women 
experience the trend described in Figure 2.2: the high school wage premium decreases for 


both genders, more strikingly for young men and women. 


Unlike most of the literature, Fact 2 focuses on the high school wage premium, rather 
than the college wage premium. This choice stems from the particular set up of Portugal: 
on the Portuguese labor market, a high school degree is a defining factor in a workers career, 
because it is less common than in other developed economies. For instance in 2017, 32% of 
young workers still do not hold a high school degree. However, it is important to note that 
the university wage premium, defined as the wage gap between university graduates and non 
graduates, follows a similar trend to the high school wage premium, as shown in Appendix 
B, Figure 2.11. 


Fact 3: Sorting between education levels and sector. Sorting between education 
level and industry is measured by age group as the ratio of the number of employed high 
school graduates to employed non-graduates in a sector. Sorting is stronger between high 
school graduates and sector A than sector B, if this ratio is larger in sector A than in sector 
B. Plotting sorting ratios by sector over time reveals stark differences by industry, as shown 
in figure 2.3. Most notably, the Services and Transport and Communications industries 
hire young high school graduates over non-graduates at a higher rate than the change in 
overall relative supply. As shown in fact 1, relative supply goes from .11 to 1.79 over the 


period, while the sorting ratio in these industries reaches 3.22 and 4.34 in 2017. Services and 
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transports and communications also hire proportionally more middle-aged workers, with a 


ratio of 1.61 and 1.39 in 2017, compared to a relative supply ratio of .82. 


Figure 2.3: High school - Sector sorting, by age group 
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Summary. The relative supply of high school graduates over non graduates rises for 
all age groups, and in particular among young workers. Meanwhile, the high school wage 
premium decreases in Portugal between 1987 and 2017. Its decline is particularly strong for 
young workers, between 16 and 34 years old. The rise in relative supply is not absorbed 
equally by all sectors: Services and Transports and Communications hire proportionally 
more young and middle-aged high school graduates than other sectors. This is indicative of 
strong sorting between these workers and the Services and Transports and Communications 
industry. 

Portugal is unique in that it has known a dramatic education expansion, going from 
10% of high school graduates in the labor force in 1987 to about 50% in 2017. It has also 
known deep changes in how workers sort with firms based on education level, age group, 
and the firm sector, as evidenced in Fact 2. As such, it is an ideal laboratory to understand 
how sorting between workers and firms drives the high school wage premium over time. 
Changes in sorting can be caused either by an increase in relative productivity of high school 
graduates in some industries, a change in preferences of young high graduates, or changes 
in substitution patterns among education levels or age groups. Meanwhile, the increase 
in relative supply of high school graduates likely drives the wage premium down. The 
wider economic interpretation could go in two different ways: a trade effect or a technology 


effect. Indeed, Portugal entered the European Union in 1986, which lowered barriers to 
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trade with other EU countries. Because Portugal has relatively more uneducated workers 
than the countries it trades with, a Heckscher-Olhin model of trade predicts it will start 
exporting more goods which require uneducated labor, which would increase firms’ demand 
for uneducated workers. The technology effect postulates that, like other Western countries 
over the period, Portugal has experienced skill-biased technological change, which instead 
would increase demand for educated workers. The decreasing wage premium would then be 
explained by the relative increase in educated labor supply, which outbalances the rise in 
demand. To tell apart these two interpretations, I parametrize in the next section the model 
presented in section 2 to untangle the effect of changes in relative supply from changes in 


firm production and worker preferences, and evaluate their impact on sorting and wage. 


4 Identification and Estimation 


Rearranging equations (2.6) and (2.7) (see appendix C), we obtain that amenities a and 


production y verify the following equations: 


Ne 
Agyk = Uz — Wayk + € log Lyx — log os 


Yyk = Vy or Wayk 25 E log Lyk — log My. 


(2.10) 


Hence Qz,, and yy, are identified up functions U, and V,. Inspection of (2.6) and (2.7) 
shows a model generated by az, + a, and U, — a, for any a;, would be observationally 
(i.e. matching and wage would be the same) equivalent to a model generated by a,,, and 
U,, if it was not for the fact that unemployed workers are accounted for, and that their 
amenities are assumed to be 0. Because U, and U, + a; do not generate the same share 
of unemployed workers, a, is identified from observing the share of unemployed workers. 
Single firms (who would not employ anyone) are not observed however, so that two models 
generated by yy, and V, or yyx + by and V, — b, are observationally equivalent. ‘There- 
fore, I set any variable that varies in firm type y but is constant across workforce k to zero 


in firm production by assuming a Constant Elasticity of Substitution in the parametrization. 


The model’s predictions on matching (2.6) and wage (2.7) allow to separately identify 
amenity and productivity functions (azy),,, and (yy), This would not be true if we observed 
only matching, as a and y appear together in the matching prediction, and only total surplus 
can be identified from this equation. If only wages were observed, the same problem arises 
and only the difference between firm production and worker amenities is identified. In this 


case one must assume that amenities are zero in order to identify production. 
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Note that the model identification does not rest on observation of demand or supply 
shifters. Instead, it uses variation in matches within agents of the same type. As a though 
experiment, imagine there would only be one type of workers. Then under the assumptions 
made on unobserved heterogeneity, the number of workers matched with a type y firm rela- 
tive to how many are matched with a type y’ firm informs the model on the exact amenity 
difference perceived by workers between firms of type y and firms of type y’. Because I as- 
sumed that unmatched workers perceive no amenity, these differences translate into amounts 
of amenities, that are always relative to the unmatched worker’s, as is usual in this type of 


logit models. 


In any given period t, I aim at parametrically estimate at and y'. All amenity and pro- 
duction parameters are allowed to vary with time, and in what follows I drop the superscript 
t to ease the exposition. I assume N = 6 worker types that are the combination of two 
education levels, and three age groups. The education levels are high school graduates H 
and non graduates L, and the age groups are young y (below 35), middle-aged m (between 
35 and 54), and senior s (above 55). Let e(x), a(x) be type x’s education level and age 


group. Firm workforce k is composed of the numbers of each worker type employed 
k= (harap Risa Ku,s, Ries Kiwis Rie) : 


Employed number of worker k, is directly observed in the data and defined as total number 
of hours worked monthly by workers of type x hired by the firm, divided by 174, the monthly 
hours equivalent of a 40 hours week. Hence each k, counts the full-time equivalent of the 
number of type x workers employed by the firm. This measure is not necessarily an integer, as 
part-time workers would count as fractions of the full-time equivalent. Type y firm produces 
according to a nested Constant Elasticity of Substitution (CES) production function with 
different parameters depending on its type y: 


y 


sue = [OR H(R) = + CLLR) |] - tao 


where aggregates H(t) and L(t) are: 


H(k)=| SD Aaah and L(k) =| > danke 


ac{y,m,s} ac{y,m,s} 
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Production 7”’s outer nest involves three parameters: o, 67,, 07 and two aggregate inputs 
H(k) and L(k). o € (0,00) is the elasticity of substitution between education levels, it is 
greater than one if high school graduates and non graduates are gross substitutes, and smaller 
than one if they are gross complements. o is assumed to be the same across firm types. 04, 
67 € [0,00) are graduates and non graduate’s productivity parameters. Both parameters 
may vary by firm type y. In addition to their CES production function, firms experience 


friction we if they employ workers of type x. The rationale is that if there are few workers 


of type x, then it is costly for the firm to find and hire them. v¥ measures this cost by sector y. 


The CES parametrization is the last piece needed for complete identification of produc- 
tion y: indeed, there does not exist a constant b, such that 7, and y,,+, are observationally 
equivalent across all workforces k under the CES assumption. This is true up to search fric- 


. 7] 
tion >>, 1{%.,>0];-, hence search costs v¥ are not comparable across sectors. 


Aggregate labor inputs H(k) and L(k) form the production function’s inner nest. They 
each depend on four parameters: three age productivity parameters each: My and ve € 
[(0, co) and one elasticity of substitution between age levels each: 7” and rt” € (0,00). Elas- 
ticities vary by education level but are the same across firm types, while age productivity 


vary with firm type y. 


The production function is close to the one used by Katz and Murphy (1992), and Card 
and Lemieux (2001): it assumes imperfect substitution and varying productivity in the 
tasks performed by different education levels and age categories. Capital is not included as 
an input, but may impact productivity parameters through firm type: if two firm types use 
different levels of capital in relation to education levels, it is reflected in the levels of 0%, 
and 6%. Unbiased technological change that increases all workers productivity results in an 
increase in both 67, and 67. Technological change may be biased towards an education level 
if its productivity increases faster than the other’s. This production function also allows 
more flexibility than Card and Lemieux (2001) by letting elasticities of substitution and age 


productivity vary in time. 
Production assumes constant returns to scale. Note that it is homogeneous of degree 


one, and therefore two functions parametrized with 6 and A or c x # and A are equivalent. 


To distinguish between these versions, I impose normalization condition: 
> Mee) MH 1 wy. (2.11) 
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I assume worker amenities are constant in k: 
eh =e (2.12) 


G4 reflects type x worker preferences for type y firms over other firm types. In particular I 


assume workers are indifferent to workforce size. 


Given these functional forms, I am looking to estimate in every period t parameters 
(bn) ON 7). (Finy Ory» (22) oy» TH TL and o. To this aim I use a maximum likelihood 


method, which I describe in what follows. 


The model predicts matching jy, as a joint distribution on firms and workforces, which 
can be compared to observed matching /1,x, which is simply the number of firms matched 
with workforces k in the data. Let also Se be the number of unemployed worker of type x. 
Let w;; be the observed wage of worker 7 employed by firm j. Observed wage w,; is assumed 
to be a noisy measure of predicted wage Wz,y,4, where kj is the entire workforce employed 


at firm j. In other words: 
Wij = Weyjk; + Vig Where viz ~ N (0, s*) iid, (2.13) 


where v;; is a centered measurement error of variance s*. Under assumption (2.13), 


observed average wage Woyk for type x workers hired by firm y in workforce k is distributed 


. 1 ane) ee 
Weoyk = K, ys Wasyjky ~ N (0. e) iid, (2.14) 


yk taq=a 
DYI=Y 


as 


where Toe is the total number of type « workers hired by firm y in workforce k in the 
data: hog = k,/ty,. Because there is a very large number of observed wages in the data (as 
many as there are workers), I choose to work with observed average wages by worker type, 
firm type and workforce in the likelihood estimation. This reduces the likelihood function 
complexity but does not limit estimation: the model parameters as well as variance s? can 


still be recovered from log likelihood maximization. 


Let fy, (0, 8, n,m) and Wryx(T, B,n,m) be the matching and wage predicted by the model, 


given parameters T = ((6%,)y, (%)y, (Aja)a; (Az,a)a; TH, TL, 7), 8, and worker and firm type 
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distributions n = (nz), m = (my),. The log likelihood of observing pair (a, y, k, W) is then 


Ke flyk log Hyx(V, B,n,m) 


ag (Wi zyk Wp l yom) 1 s° 
—K, —.] = ; 
yk 52 9 °8 


xyk 


Meanwhile, the log likelihood of observing an unemployed worker of type x is 
S$ log S¢(I’, 8, n,m, 8”). 
The log likelihood method therefore solves 


max l(I', 8,n, m, s”) 


T’,6,s2 
= max > 7 hf log yn(T, B.,m, 5°) + J S$ log $5 (0, 8,n,m, 5?) 
a yk x (2.15) 
Way — mall yea fs 
~Y Ka! yk — W aie OM ime )) ~ log = 
25? 2 K. k 
x yk xy 


T run log likelihood estimation on ten separate three year periods between 1987 and 20177. 
Years in each period are pooled. In each period, I observe number of workers and firms (x). 
and (my), directly in the data. I normalize without loss of generality the total mass of firms 


in each period to 1, so that scaling factor F' is a My, and input nz = 4 and my = oe to 


likelihood estimation. 


I solve numerically for problem (2.15) using a nested method: in the inner loop, (6, A, 7, 0, 3), 
5S§(8,A,7,0, 8) and Way.(9, A, 7,0, 8) are computed according to (2.6) and (2.7). Scaling fac- 
tor € is set to 1. In the outer loop, I update (@,A,7,0, 3) using Adam, a gradient descent 
method with momentum (Goodfellow et al. (2016), Kingma and Ba (2017)). Variance s? is 


obtained in the outer loop through first order condition: 
2 1 = 9 2 
SS W a > gut (Weve —_ teal, B, n, m)) 
ac yk 


More details on estimation can be found in appendix D. 


4Periods are 1987-1989, 1991-1993, 1994-1996, 1997-1999, 2000-2003, 2004-2006, 2007-2009, 2010-2012, 
2013-2015, 2016-2017. Since data for years 1990 and 2001 are missing, the last time period spans only two 
years. 
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5 Results 


5.1 Parameters estimates 


Estimates for high school graduates and non-graduates productivities 67, and 0% by indus- 
try y are presented in figure 2.4. Estimates shows education productivity are heterogeneous 
by industries, and have evolved in non-linearly: high school graduates productivity dis- 
plays an impressive surge starting in 2010, especially in the Transport & Communications, 
Manufacturing and Services industries. Non-graduates productivity drops for all industries 
between 2010 and 2013. As a result, high school graduates’ productivity relative to non- 


graduates rises at the end of the period. 


Figure 2.4: Estimated education productivities 
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95% confidence interval 


The spectacular increase in educated workers’ productivity that starts between 2007 and 
2010 and stabilizes after 2013 coincides with the sovereign debt crisis in Portugal, which may 
give some clues as to why the increase is so large. Indeed, in an attempt to curb unemploy- 
ment and stimulate the labor market, Portuguese labor institutions have been modified on 
several levels: minimum wage, which had steadily increased in the previous years, was frozen, 
severance pay was lowered, and an attempt was made to revise the bargaining of wages at 
the sectoral level, although this attempt met with a lot of resistance and never entirely 
went through. The first two changes might have had an impact on worker productivity as 
measured by the model on their own however: because estimated productivity is positively 


tied to wage, the yearly increases in minimum wage have driven the increase in uneducated 
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workers productivity before 2010. When minimum wage froze, this driver of growth did too. 
Second, the lowering of severance pay seems to have accelerated the replacement of unedu- 
cated workers by educated workers in firms: in 2007, 48.8% of firms employed at least one 
educated worker. In 2010, this percentage was up to 53.0% and in 2013 to 57.7%. Hence, 
the matching distribution has drastically changed in these years, which is likely driving the 
rise in educated workers’ productivity, and contraction in uneducated workers’ productivity. 
Under this interpretation, the sudden changes in trend observed in 2010 reflect an overdue 
adjustment of matching on the labor market, made possible by the change in labor market 


institution. 


Figure 2.5 shows the evolution of age productivities Ay, and Ay, by age group and 
over time. Because A774 and Az,q Sum to one over age groups in any given year, they can be 
interpreted as shares of each age group in total labor input by high school graduates and non 
graduates. Estimates A;, are fairly stable over the period up until 2010-2013, with middle- 
aged workers making up most of the labor input for non high school graduates. Their share 
in labor input increases to an even higher level (about 75%) in 2010-2013. Estimates Aq,y 
and Ax m increase steadily until the early 2000s, but high school graduates senior workers 


input remains the most productive of the three at the end of the period. 


Figure 2.5: Estimated age productivities 


Below High School Above High School 
1.00 1.00 
0.75 0.75 
F050 50.50 
0.25 0.25 
0.00 0.00 
1990 2000 2010 1990 2000 2010 
Year Year 


* 16-34 ~ 35-54 = 55-68 


95% confidence interval 


Figure 2.6 presents the change in worker preferences for firms $¥ in euros per hours 
worked. All education levels and age groups hold high preferences for Retail & Hospitality 


over the period, and low preference for Transport & Communications. High school graduates’ 
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preference for Services increases over the period, while their preference for Manufacturing 


decreases. 
Figure 2.6: Estimated worker preferences 
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Finally, figure 2.7 presents estimated elasticities of substitution between education level 
og, and age groups Ty and Ty. Tz, the elasticity of substitution between non graduates age 
groups is generally very high, suggesting age groups are perfect substitutes. o is between 
1.68 in 1987-1989 and increases monotonically to 37.67 in 2016-2017. Ty is between 2.68 in 
1987-1989 and 26.50 in 2016-2017. 


Figure 2.7: Estimated elasticities of substitution 
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These findings make sense with regards to my estimation method, which unlike most of 
the literature does not postulate a representative firm by industry, but instead estimates 
elasticities of substitution at the firm level. Because I account for all firms in the economy 
and fit the matching distribution, if a large share of firms employs workers of a single educa- 
tion level, either below or above high school, it would be unsurprising to obtain a estimate 
for o that suggests that education levels are perfect substitutes (i.e. a very large 0). The 
same reasoning is valid for substitution between age groups. Scanning the sample for such 
patterns reveals that a majority of firms employ workers of a single education level: they 
amount to 78.5% of firms in 1987 and 63.1% in 2017, as well as a single age group: firms who 
employ a single high school graduates age group amount to 72.8% of firms who employ high 
school graduates at all in 1987 and 58.2% in 2017. For non graduates, the proportion is be- 
tween 44.0% in 1987 and 52.0% in 2017. Perfect substitutability of worker types is consistent 
with the view that the production function at the individual firm level is linear in labor in- 


puts, which is an assumption that had been made in the literature (Hellerstein et al. (1999)). 


Discussion. Takeaways from the structural estimates presented in this section are three- 
fold. First, high school graduates productivity has surged over the period. This observation 
is strongly consistent with a hypothesis of skill-biased technological change, i.e. an increase in 
worker productivity that favors educated workers, rather than with the competing Hecksher- 
Ohlin trade hypothesis, which predicts an strengthening of firms’ demand for uneducated 
workers. Second, young and middle-aged high school graduates’s share in productivity has 
increased over the period, which suggests the decreasing high school wage premium for these 
age groups cannot be explained through an increased demand for experience. Third, workers 
hold heterogeneous preferences towards sectors. Amenities perceived in the Transport & 
Communications and Services sector are below zero for most of the period, which puts an 
upward pressure on wage in these sectors. 

These observations must be interpreted in the light of the institutional changes that have 
occurred in Portugal over the period: the Portuguese labor market is characterized by a 
steadily (if slowly) increasing minimum wage: in nominal terms, hourly minimum wage is 
2.05 euros in 1999 and reaches 3.73 euros in 2017. Most Portuguese workers are also covered 
by collective bargaining agreements. Finally, it is costly for a firm to fire a worker, because of 
generous severance packages. Between 2011 and 2014, Portugal has implemented a number 
of reforms on its labor market: minimum wage was frozen (until the end of 2014), the scope 
of collective bargaining restricted and terminating workers made less costly. These reforms 
coincide with a break of trend in the estimated education productivities, and may impact 


them in two ways. First, the freeze in minimum wage may be partly responsible for the 
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fact that high school non-graduates productivities stop increasing after 2010: because the 
minimum wage is binding for a large number of non-graduates, its freeze must reverberate 
on non-graduates’ productivity, since the model predicts it increases wage. However, the 
minimum wage freeze on its own cannot account for the estimated drop in non-graduates 
productivity, nor the surge in graduates productivity. Both of these appear to be driven by 
matching, as the number of firms which employ only high school graduates increases rapidly 
between 2007 and 2013. In light of the institutional changes that have taken place over 
these five years, changes in matching have been made easier by lowering workers’ severance 
package and reducing the scope of collective bargaining. Hence an interpretation for the 
surge in graduates productivity is an overdue increase in firm demand that was kept low 
before 2010-2013 not because of a low productivity, but because of stringent labor market 


institutions. 


5.2 Model Predictions 


Table 2.1 compares the slopes of observed and predicted sorting over time. Slopes are 
obtained by fitting a time trend to the log of relative education supply in each age group and 
industry. They can be interpreted as average increase in sorting strength (measured as change 
in relative supply within an industry) over the period: for instance relative supply increases 
by on average 127.4% every period in the 16-34 age groups and the primary industries. Model 
predictions fit the manage to fit the changes in the data quite well, especially for the 16-34 
and 35-54 age groups. 


Table 2.1: Sorting average yearly percentage growth - Observed versus Predicted 


16-34 | 35-54 | 55-68 
Industry Data Prediction | Data Prediction | Data Prediction 
Agr., Mining, Energy, Constr. | 1.274 1.037 0.976 0.876 1.178 0.838 
Manufacturing 1.478 1.881 1.222 1.774 0.928 1.341 
Retail, Hospitality 1.16 1.274 1.315 1.262 0.933 0.909 
Transports, Communication 1.391 1.993 1.748 2.248 1.454 1.947 
Services 1.03 1.565 1.05 1.273 0.625 0.728 
Overall 0.279 0.25 0.226 0.233 0.181 0.253 


Figure 2.8 compares observed and predicted average wage by education level and age 
group over time. Model predictions match the slope of average wages for almost all education 
levels and age groups, except for senior high school graduates. Average wage for this worker 


type is over-estimated by the model. This is likely due to the importance of collective 
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bargaining in Portugal, which presumably tightens the wage distribution and is not accounted 
for by the model. 


Figure 2.8: Average wage by education level and age group - Observed versus Predicted 
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5.3 Counterfactuals 


There are four categories of inputs that determine optimal matching and wage and that 
change over time: the number of workers of each type, the number of firms in each sector, 
production function parameters and worker preferences parameters. The first two are ob- 
served directly in the data and the last two are estimated. In the counterfactuals exercises 
that follow, I vary each one of the four inputs, holding all other three fixed between 1987- 
1989 and 2016-2017. The first counterfactual keeps the shares of each sector, production 
parameters and worker preferences constant to their 1987-1989 levels but lets the worker 
demography, both in terms of age group and education level, vary as it has in the data be- 
tween 1987-1989 and 2016-2017. The second counterfactuals holds production parameters, 
worker preferences and worker demography fixed but lets sector shares vary. The third and 


fourth counterfactuals vary only production parameters and worker preferences, respectively. 


The two object of interests are education-sector sorting and high school wage premium. 
The model makes predictions on both of these through equilibrium jz and w. Sorting between 
education and sector is defined as the ratio of high school graduates over non-graduates 


employed in a sector y, for a given age group a in a given period f: 


= ok kiabtyr (I, og n', m') 
> Rr abyr(T, Gta) 
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r(I", BY,n',m') 


where ju is the predicted matching. Therefore the change in sorting between two periods 


t and s is eee 
ee Tya(l°, eh n, m*) 
yt Py all’, Be nt, m') : 


Let t=1987-1989 and s=2016-2017. Then the counterfactual change from labor supply is 


lB = fal. ae) 
ya Tiga De) 


where ryq(I", 6',n*,m‘) is the counterfactual sorting if only labor supply n evolves to its 
2016-2017 level, while all other factors , 6 and m stay at their 1987-1989 levels. 


Similarly, define wage premium for age group a in a given period f: 


wo gt ni m') -_ yi kmail’, can ie io les aeer @ ae rcs m') | 
_~ ok Ke ofall”, py n', m')weratyk(1", B Ww m') 


so that counterfactual change in wage premium from labor supply is 


WLP = Wy,a(I", ee n, m') 
y,a Bal’, Bt, nt, m*)- 


The next tables 2.2 and 2.3 show the predicted changes in sorting and wage premium 
along with the four counterfactual scenarios. Note that because of non-linearities, the sum 


of changes in all four counterfactuals does not sum to the predicted change. 


Table 2.2 shows changes in relative employment and the increasing presence of high 
school graduates in the Manufacturing, Services and Transport & Communications sectors 
are mainly driven by labor supply: the rise in educated workers’ share mechanically in- 
creases their employment share in each industry. The counterfactual increase in sorting is 
uniform across industries however, while predicted sorting is not. An important driver of 
the heterogeneous increase in industries appear to be the evolution of production parameters 
and worker preferences: production parameters have a particularly strong positive impact 
on sorting in Manufacturing and Transport & Communications sectors, while worker prefer- 
ences drive sorting in Transport & Communications and Services to lower levels than other 


sectors. 


Ltd 


Table 2.2: Changes in Sorting - Predicted versus Counterfactuals 


dues | 1987-2017 | Labor Industry Production Worker 
change supply | composition parameters preferences 
Agr., Mining, Energy, Constr. oto 9.4 0.87 0.87 0.77 
Manufacturing 22.88 8.01 1.01 1.19 0.76 
Retail, Hospitality 8.94 8.51 0.92 0.74 0.85 
Transports, Communication 43.33 7.04 0.9 4.12 0.55 
Services 14.59 6.4 0.79 0.73 0.61 


Interpretation: Predicted relative employment of high school graduates to non-graduates 


is multiplied by 3.73 between 1987 and 2017 in Agr., Mining, Energy, Constr. 


Table 2.3 shows how different scenarios impact changes in wage premium. Consistent 
with the data, the model predicts a decline in wage premium for all age groups, and es- 
pecially young workers. Yet each factor except the change in industry composition has a 
heterogeneous effect on wage premium depending on age group. First labor demographics 
drive young and middle-aged workers wage premium down. Surprisingly, the same is not 
true of senior high school graduates: if only labor supply changes over the period, the senior 
workers’ counterfactual wage premium increases. This is because their supply increase, but 
so does the supply of senior non-graduates, especially relative to younger non-graduates, so 
that the change in senior worker wage premium is actually positively impacted by labor de- 
mographics. The evolution of production parameters has a positive effect on young workers’ 
wage premium, but a negative effect on other age groups, likely because age productivity 
of middle-aged non-graduates increases and age productivity of senior graduates decreases 
over the period. Finally workers preferences have a strong, positive impact on all age groups 
wage premia. Appendix E shows the details of this impact by industry. For young and 
middle-aged workers, it appears to be mainly driven by the high school wage premium in 


the Retail & Hospitality and Services sectors. 
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Table 2.3: Changes in Wage Premium - Predicted versus Counterfactuals 


1987-2017 Labor Industry Production Worker 
Age group Las 

change supply | composition parameters preferences 
16-34 -0.28 -1.09 -0.7 0.19 3.67 
35-54 -0.18 -0.85 -0.72 -0.28 2.29 
55-68 -0.01 6.66 -2.14 -1.31 LOY 


Interpretation: the predicted wage premium for the 16-34 age group has fallen 
by 28% between 1987 and 2017 


The main takeaway from Table 2.3 is that demographic change, through its impact on 
labor supply, is the main driver over the decreasing wage premium for young and middle- 
aged workers. Since estimated parameters show a rise in high school graduates productivity, 
it seems the prevailing interpretation is that although skill-biased technological change took 
place over the 1987-2017 period in Portugal, it has been outbalanced by the formidable 
increase in the relative numbers of high school graduates. This conclusion must be nuanced 
in the case of senior workers however: their wage premium is mainly dragged down by the 
changes in industry composition, i.e. structural change, and the evolution of production 
parameters, as the relative productivity of senior high school graduates with respect to 
young and middle-aged high school graduates declines, while for non graduates it remains 


constant. 


6 Conclusion 


This paper studies wage inequality in Portugal between 1987 and 2017, and seeks to ex- 
plain the decreasing high school wage premium over the period. The decrease in high school 
wage premium is particularly stark among young workers, and it is accompanied by a rapidly 
rising employment share of young educated workers in specific sectors such as Transport & 
Communications and Services. Over the period, Portugal has experienced a surge in its sup- 
ply of high school educated workers, as well as sweeping changes in its industry composition, 
as Services have replaced Manufacturing as the first employer in the country. The increase in 
educated workers’ employment share in the aforementioned sectors suggests a productivity 
boost in these sectors consistent with skill-biased technological change, but the decreasing 
high school wage premium observed could also be consistent with a Hecksher-Ohlin theory, 


whereby less educated workers see their wage rise as they start being more demanded when 
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Portugal joins the European Union. 


To jointly explain changes in sorting between workers and firms, and the decreasing wage 
premium on the Portuguese labor market, I build a static model of one-to-many matching 
with transferable utility. Using predictions for both wages and joint distribution of firms and 
workforces, I am able to separately estimate worker preferences for firms and parameters for 
firms’ nested CES production functions. Estimates show high school graduates productivity 
has increased in all sectors, consistent with a theory of skill-biased technological change. 
Counterfactuals suggest changes in sorting are driven by heterogeneity in sectors’ relative 
demand over time, as well as changes in workers’ preferences. They also suggest the decreas- 
ing high school wage premium is driven mainly by an increase in the relative supply of high 


school graduates to non-graduates. 
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A Data 


I use Quadros de Pessoal, a matched employer-employee dataset provided by the Por- 
tuguese National Institute (Instituto Nacional de Estatistica, INE). Quadros de Pessoal is 
issued yearly from 1987 to 2017, based on firms declarations on their characteristics and 


their employees’. Both workers and firms are identified across time by a unique identifier. 


I use information on firm industry, worker’s age and education level Industries are pro- 
vided as “economic activity”, up to 3 digit level. Because of classification changes at the 
2 and 3 digits level over time, I use the one digit level classification, to keep consistency 
over the years. I exclude firms whose economic activity at the 1 digit level are unknown. 
Worker education is provided as a 3 digits classification, out of which I aggregate 9 levels: 
no schooling, primary schooling 1 (up to 10 years old), primary schooling 2 (up to 13 years 
old), primary schooling 3 (up to 15 years old), completed high school, some higher education, 
bachelor, masters and PhD. Worker age is used directly without further cleaning. I exclude 
from the sample any worker whose education level of age is unknown (3.9% of observations 


per year on average) 


I also use information on wages and number of hours worked. Wage is provided as a 
average monthly earnings, that accounts for bonuses and extra hours earnings. Number of 
hours is provided as the baseline number of hours in the contract, plus any extra hours 
worked (averaged overt he year). I consider the sum of base and extra hours as my measure 
for number of hours worked per month. I divide monthly wage by monthly hours to obtain 
a measure of hourly wage, and deflate it. Real hourly wage is my final measure of wage. I 
exclude from the sample any worker who has worked zero hours or earned zero wage over the 
year (11.5% of observations per year on average). These are mainly, in my understanding, 
workers on sick leave, maternity leave, or sabbatical that do not contribute to firm produc- 
tion in that year. I also exclude from the sample any workers who are strictly under 16 or 


above 68 (the retirement age in Portugal) 


Additionally, I exclude any observation with a missing or 0 worker ID (3.5% of observa- 
tions per year on average). I am also faced with an issue of duplicate worker IDs which, even 
though it is minor in the sample later years (about 4.8% of observations per year on average 
from 2007 to 2017, including 0 IDs), it is much more serious in the earlier years (about 19% 
of the sample in 1987, including 0 IDs). I suspect these to be encoding mistakes that relate to 


actual different workers. Some can also be workers who hold two different jobs (for instance 
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an employee somewhere who also have a self-employed activity). Because I do not use the 
panel aspect of the data, and therefore encoding mistakes in workers ID are not a problem 
in my analysis, I keep most duplicates, only removing observations who appear more than 
5 times in any given year (an average 6.1% of observations per year, less than 1% of the 
dataset starting in 2007). I also exclude from the sample any worker who is self-unemployed, 
in unpaid family care, or labelled under “other” employment contract (7.1% of observations 
per year on average). The rationale behind not considering self-employed is that many of 
self-employed workers actually work as consultants for a firm, with no way to link them. 


Self-employed workers on their own represent about 1% of the dataset. 


B_ Details on Empirical Facts 
B.1 Wage levels by gender 


Figure 2.9: Average wage by education level and age group - Men 
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Figure 2.10: Average wage by education level and age group - Women 
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B.2 The university wage premium 


The university wage premium is obtained through regressing the following equation: 


log Wijt = S L university graduate,| Cag + Git Vit oF dit alr Uijzt- 
aiE{y,m,s} 


Figure 2.11: University wage premium by age group 
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C Proofs 


Proposition 1 


Proof. Let Z, = maxy~ {Uxyk + €€iyx} and Za = max, {vyx + €njx}. The proof consists is 
showing that Z; follows a Gumbel distribution with expectation € log >7,, }7), exP (4 zyk ) and 


Z follows a Gumbel distribution with expectation € log 5°, exp (2), 
C — Unyk 
P [Z1 < c| =P oo < j Vy + 


“Pins 2 


_ _ yk — € 
Ter(-(%) 
= log P27 c= - Dew ( =*) 


( —c + log > 7,4, XP (Uzyr) ) 
= — exp é . 


And a similar reasoning shows: 


et ae e& ee Ee (ea) | 


Hence up to the Euler-Mascheroni constant, Z, follows a Gumbel distribution with ex- 
pectation € log a se (“3*) and Z, follows a Gumbel distribution with expectation 


E log 5°, exp (4), 


Proposition 2 


Proof. Following McFadden (1974), Choo and Siow (2006), the probability that worker x 


chooses option ¥, k is 


P ly, k= arg MAX Uryk a bys | =P [€€yx < Ugg.k — Uayk =e beak Vy, k] 


[Tew (- exp (“a ee = *)) exp(—e) exp (— exp(—e)) de 
y,k 
on() 


Lt ie Pe) 


A similar derivation applied on the firm side. 
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Theorem 2.1 Based on Gretsky et al. (1992) and Galichon and Salanié (2021). 


Proof. Consider the following problem over the sum of worker welfare ie u,; di and firm welfare 


J; dj: 
int fucai+ f oa 
uv J; j 


ke ker 
s.t S- S- Ui + V; > Gyr tE>_ S- €iysk + ENE VRe9 (2.16) 


x taz=2 L a4=x 
ui > Ee. 


Take any two u,v such that )>) kzteyx + Uyr > Pyx and Uso = 0 and define 


Ui = MAK ic Wasik + Cee} 
Uj = MAaxK {Vy k + ENjK- 


Then (u,v) satisfies (2.16)’s constraints. 
Reciprocally, fix any u;,v; that satisfy the constraints in this problem and define 
Let 
Unank = My ¢.251 ty = Cay, and Ug = 0 
Vyk = MANjy,-y{0j — ENjr}- 


Then the constraint in problem (2.16) becomes )>., krUsyk + Vyk > Byx- 


Applying the law of large numbers, we get that (2.16) is equivalent to 
min S- TiegGiz(Ug) + S- My Hy (vy) 
a y 
st So ketleyk + Uye = Oy, Why (2.17) 
Ls = 0. 


By complementary slackness condition, solving problem (2.16) with Uryx = Ory + Weyk 
and Vy = Yyk — doy KxWayk Yields equilibrium wage. Equilibrium supply and demand yk = 


k,,D{ obtain as the Lagrange multiplier ju,, on constraint S> keUsyx + Vyr > Pye. 


Proof. Theorem 2.2 
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Rewrite problem (2.4) as saddle-point: 


min max NyGy(uz) + m,H,(v 
ninmanc )maCa(t) + myH(o) 


+ Lyk (%. = > fe lia = va) = ye) So (—uz0) 
yk z % 


=max 5 [byk® ky 
LU 
yk 


ke So 
= Sone max »» ID ie + Uso — cn] 
: u ; Ney Ny 
Lyk H. 
— Syn LS Hay — 1) 
; y 


y 
= a S- [ykP ky dk 
y,k 
ky [yk Pree); Hyk 
—E| DD Rettynlog = — Sh log > — D thy log —* J , 
x yk - x %, yk y 


where the last line is obtained through solving for G and H’s convex conjugates: 


x 


Ks Sé 
C= max 2 yk = Ur0 — c.(n| 


yk 
* Lyk 
Ay (u) = max » mm, vo — aot . 
yk 


For which first order conditions are 


Unyk Vyk 
Reedlife = exp (=) and Lyk = as (4) 


Which ensures that ju is feasible, i.e. satisfies marginal conditions, otherwise the value of the 


social planner problem is +00. 


Problem (2.5)’s objective function is strictly concave and the maximization set defined 


by the marginal conditions (2.8) is compact. Therefore the maximum exists and is unique. 


124 


D_ Estimation 


This section covers the details of log likelihood estimation. Subsection D.1 describes 
how to data is processed into observed matchings and wages by firm type and workforce, 
subsection D.2 presents the gradient descent algorithm used for log likelihood maximization 


and subsection D.3 provides details on log likelihood gradient computation. 


D.1 Workforce Discretization 


From Quadros de Pessoal, | build a workforce matched to each firm every year, by using 
firm identifiers provided in the data. I weigh workers by their number of monthly hours 
worked on average over ear, which is directly provided in the dataset. One full-time worker 
is equivalent to 174 hours worked per month (which is a 40 hours week). If for instance a 
worker has worked 180 hours per month, she counts as is = 1.03 full-time workers. The 
distribution of firms by number of high school graduates and non graduates employed is 
plotted in figure 2.12 in the periods 1987-1989 and 2016-2017. A firm is defined through 
a distinct firm identifier-year combination. 2.12 shows a large majority of firms are small 
firms. Many firms employ no high school graduates, especially at the start of the period: 
they represent 75.9% of firms in 1987-1989, and 37.0% of firms in 2016-2017. In contrast, 
firms who do not employ no high school graduates make up 2.6% and 26.4% of all firms, in 
1987-1989 and 2016-2017 respectively. Firms who employ more than a thousand of workers 
at one education level are excluded from the graph, but not from the estimation. They 
represent 234 firms in 1987-1989 and 206 firms in 2016-2017. 


Figure 2.12: Firm distribution by number of high school graduates and non graduates 
employed 
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Performing the estimation requires to compute observed matching ji,, and observed av- 
erage wage Woyk by workforce k. The number of different observed workforces in the data 
is very large: there are 96612 combinations in 1987-1989 and 69314 in 2016-2017. Max like- 
lihood computation requires to evaluate ty, and wz, on all observed workforces. To speed 
up the max likelihood computation, I cluster observed workforces into a smaller number of 
representative workforces. To do so, choose a number of bins B. For each worker type x, 
split the interval between 0 and k/”°* in B smaller intervals, where k/”"** is the largest ob- 
served number of type x workers employed by a firm. For each worker type x, the procedure 
yields B intervals, or clusters [0,k1),...,[k2-1,k®),...,[k?, k™4"]. Each observed number of 
worker x employed by a firm falls into one of these intervals. I assign each observed number 


: . po pet 
to a cluster. The representative number of workers for each cluster is “—*—. 


In the baseline estimation, B = 15. Intervals are split according to a logarithmic scale. 
The number of observed clusters is reduced to 10359 in 1987-1989 and 21871 in 2016-2017. 


As an illustration, figure 2.13 displays worker distribution across firms by type, and the 


clustering of workforce. 


Figure 2.13: Worker type distribution and clusters, 1987-1989 
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D.2 Adam Algorithm 


Adam is a first-order gradient-based optimization algorithm. It belong to the family of 
algorithms with adaptive learning rates. Their main benefit is speed: they use information 
given by the gradient to modify their learning rate, and hence improve convergence speed. 


In particular, Adam uses momentum, i.e. an exponentially moving average of past gradients, 
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at each iteration. It also uses bias correction. Adam was first introduced by Kingma and 
Ba (2017). For a general presentation of the algorithm, see Goodfellow et al. (2016). The 


algorithm applied to the present problem goes as follows: 


Set decay rates p) = .9, po = .999, step « = le — 2, stabilizer 6 = 1°-® and tolerance 
ta = 1. 
Initialize parameters to Vp, Bo 
Initialize moment variables s = 0, r = 0 and time step t = 0. 


While max | V2! st) 
UT :,Gt,n,m,8?) 


Compute s? = r ae ae Koyk (Weve — Wayr (Tt, Be, 2, m)). 
Compute g © Vr gl(T:, G:,n,m, 87) 

Update t<t+1 

Update s + pjs+(1—pi)g andr + por +(l—po)gOg 


> tol 


Correct bias in first moment s <— ip and second moment f <— 
Compute update A(T, 3) = ae 
Apply update ((y41, 8:41) < (Ts, &) + eA (T, 8) 

end While 


en 
1—ph, 


D.3 Likelihood gradient 


Applying Adam requires to compute likelihood gradient Vp.gl(T;, ;,n,m, s7). Let w € 
([', 2) be any of the parameters governing firm production or workers’ preferences. Log 


likelihood differential with respect to w is 


OUT Pot m, 8” O log pyr (T, B, n,m, 8”) ~,O log S2(T, 8,n,m, $7) 
ee oa ; Ow + D5 : Ow 


2 
Ows 4 (T,8,n,m,87) 
(Weve _ wey (PB snymss?) 


os Ow 
ee. , 
—— 2s? 
where 
O log ply (T’, B, m,™, s”) = 1 OP yx _ k OU _ OVy 
Ow 7 14+ 30 ke \ Ow * Ow Ow 
Dlog S8(I',B,n,m,s?) __ Uy 
Ow Ow 
OWayk(T, B, nm, ™, s”) -_ 1 OP yb = k OU; = OV, _ OQeyk + OU» 
Ow 14+ 30 ke \ Ow * Aw Ow Ow Ow 


7, and Pay can be computed directly given their assumed functional forms. 


My solve the following linear equations: 


Ue 
ae and 


key aU, aV,\ _ ky A®,, 
Stee ( - haa 4) = ipsa ae 


yk yk 


1 aU, OVvy\ _ 1 OD, 
igs Et (sone + = igs Bis Vy, 


k k 


which are obtained by differentiating marginal conditions (2.8). 


E Counterfactuals 


Table 2.4: Changes in Sorting in 16-34 age group - Predicted versus Counterfactuals 


ree | 1987-2017 | Labor Industry Production Worker 
change supply | composition parameters preferences 
Agr., Mining, Energy, Constr. 5.18 13 0.81 0.93 0.71 
Manufacturing 33.39 12.5 1.03 1.19 0.8 
Retail, Hospitality 12.31 10.64 0.9 0.78 0.86 
Transports, Communication 44.96 8.09 0.84 1.93 0.46 
Services 22.05 9.93 0.74 0.59 0.64 


Table 2.5: Changes in Sorting in 35-54 age group - Predicted versus Counterfactuals 


Hiduseey | 1987-2017 | Labor Industry Production Worker 
change supply | composition parameters preferences 
Agr., Mining, Energy, Constr. 4.26 11.51 0.91 0.84 0.81 
Manufacturing 27.24 8.78 1.01 Hele Og 
Retail, Hospitality 10.08 10.45 0.93 0.69 0.81 
Transports, Communication 65.55 10.82 0.91 5.89 0.52 
Services 18.73 7.82 0.81 0.83 0.59 


128 


Table 2.6: Changes in Sorting in 55-68 age group - Predicted versus Counterfactuals 


1987-2017 


Labor Industry Production Worker 
Industry en 
change supply composition parameters preferences 

Agr., Mining, Energy, Constr. 2.87 7.13 1.33 0.59 0.87 
Manufacturing 14.69 2.99 0.86 0.93 0.79 
Retail, Hospitality 4.43 6.09 0.76 0.3 0.75 
Transports, Communication 32.83 5.03 1.39 28.1 0.53 
Services Lor 2.86 1 0.65 0.58 


Table 2.7: Changes in Wage Premium in 16-34 age group - Predicted versus Counterfactuals 


1987-2017 


Labor Industry Production Worker 

Industry on 
change supply | composition parameters preferences 

Agr., Mining, Energy, Constr. -0.27 -1.04 -0.64 0.38 -20.76 
Manufacturing -0.56 -1.15 -0.33 el? -0.2 
Retail, Hospitality -0.46 -1.03 -0.67 0.14 -4.54 
Transports, Communication 0.21 -1.07 -0.52 1.18 0.14 
Services 0.71 -1.13 -0.6 0.11 4.47 


Table 2.8: Changes in Wage Premium in 35-54 age group - Predicted versus Counterfactuals 


1987-2017 


Labor Industry Production Worker 

Industry we 
change supply | composition parameters preferences 

Agr., Mining, Energy, Constr. -0.2 -0.81 -0.6 -0.24 46.31 
Manufacturing -0.15 -1.02 -0.65 -0.27 0.23 
Retail, Hospitality -0.46 -0.78 -0.69 -0.45 39.02 
Transports, Communication 0.11 -0.85 -0.61 0.61 0.18 
Services 0.81 -0.88 -0.7 -0.07 3.94 
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Table 2.9: Changes in Wage Premium in 55-68 age group - Predicted versus Counterfactuals 


ida | 1987-2017 | Labor Industry Production Worker 
change supply | composition parameters preferences 
Agr., Mining, Energy, Constr. -0.49 ime -1.9 -1.18 0.29 
Manufacturing 0.6 13.56 -4.65 -1.56 L.71 
Retail, Hospitality -0.17 5.76 -2.35 -1.29 0.99 
Transports, Communication -0.15 5.79 -1.4 -0.89 0.42 
Services 0.35 4.82 -1.72 -1.39 1.59 


F Comparison to Card & Lemieux’s model 


Katz and Murphy (1992) and Card and Lemieux (2001) have shown that the CES pro- 
duction function parameters are identified from assuming that labor is optimally supplied 
to the economy and that wages are competitive, that is assuming that in each year t a 


representative firm solves 


max 7(t) — ~ Aww — 3 Lig sn: (2.18) 


where y(t) is the CES production function described in section 4 with no dependence on firm 
type, as in this set up I assume a single representative firm. I also assume in this section 
that elasticities of substitution 7”, 7”, a, as well as age productivity parameters (Axa) a9 


(Aza), do not vary with time. Wages are competitive and equal to marginal productivity: 


tH-1 ; 1 ; 
waalt) = Aye Halt) x Ou(t) = H(t)7#* x o(t)* Va € {y,m, s}, (2.19) 

tThO-1 rl 1 1 rl “ 
What) =A, La(t) x 6,(t)= L(t) * x y(t)? Va € {y,m, 5} 

Which results in relative wage equations: 
wae | Ty —1 (<2) 1 Ga 
lo = lo — | ——_lo ‘ 
. (Ss (t) TH 6 Ata cH 6 Ay (t) (2 20) 
(t) 


1 WLa OS i 1 l Aka 1 | L(t) 
) = ) ——lo 
& WL,a! (t) TL 8 A La! 7h 6 La (t) 
Restricting (A,a),, (Az,a),’8 variation in time, and adding a stochastic shock to account 


for measurement errors in observed wage and hours worked, relative age productivity and 
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age elasticities of substitution can therefore be estimated by ordinary least squares through 


equations: 


log (wae = dita,a9 — = log ( LB) pate (2.21) 


WL att) 1 a(t) 
] —_—— = d a,ag l a,ag> 
= ee 0) ie aa (7 Oy sie: 


where do is the reference age category. Age productivities Aq, Az,q can then be retrieved 


from fixed effect diay, dL,aay USINg Normalization conditions (2.11). 


Estimates for aggregate labor inputs H(t) and L(t) can be computed from estimated 
age productivities and elasticities of substitution. First order conditions (2.19) also give an 


expression for relative wage across education levels: 


w(t) <n CECH) te (nt) 2a). 


07 (t) 
ductivities and elasticities of substitution and adding measurement error gives us equation 


log (sas) fF 1G) “log (Fe) + Va.ty (2.23) 


where I(t) is a linear function of time and f is estimated from equations (2.21). 


Assume log (28 3) follows a linear time trend. Plugging in previously estimated age pro- 


Weighted Least Square estimation of equations (2.21) and (2.23) are presented in table 
2.10 and 2.11. The weights used are the inverse sampling variance of estimated wage gaps”. 
Labor input from any given education level and age bin is computed as the total sum of 
hours workers per month in a year. Average wage premia between age and within education 
are used as outcome variable in equation (2.21) and computed yearly and by education level 
by regressing individual wages on a dummy for age, plus fixed effects for gender, industry 
and region, to control for composition effects. Average wage premia between education levels 


and within ages are computed in the same fashion. 


°In equation (2.23), I weight by the inverse of the sum of the wage gaps and i inverse sampling variance 
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Table 2.10: Estimated age productivities and elasticities of substitution - Reduced Form 


| Below High School | Above High School 

aT 15.907 15.301 

(2.216) (2.427) 

Aaj 0.332 0.331 

(0) (0.001) 

Av 0.333 0.332 
(0) (0) 

Aw 0.335 0.338 

(0) (0.001) 

Hie 0.994 0.972 
Obs. 58 58 


Estimated age elasticities of substitution 7 in Portugal from 1987 to 2017 are higher than 
estimates found by Card and Lemieux (2001) for the US, the UK and Canada from the 1970s 
to the early 1990s, which are between 4 and 6. This reflects the lesser impact of movements 
in relative age group supply on age group wage differential in Portugal than in the US, UK 
and Canada. Estimated age productivities are very similar between education levels. They 
are also balanced between age groups, which suggests no age group is much more productive 


than another. 


Table 2.11: Estimated education productivity growth and elasticity of substitution - Re- 
duced Form 


or 4.933 


(0.151) 
log 34 | 0.018 

(0.001) 
R? 0.974 
Obs. 87 


Elasticity of substitution between workers below and above high school is also higher in 
Portugal than what is found by Katz and Murphy (1992) for the US and Card and Lemieux 
(2001) for the UK and the US, who has estimates between 2 and 2.5. However Card and 
Lemieux (2001) find no significant effect of relative labor supply on relative wage between 


education levels in Canada, suggesting a very high substitutability of graduates and non 
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graduates in that country. their analysis also focuses on college versus high school grad- 
uates, which is not directly comparable to my analysis on high school graduates and non 
graduates, who appear to be more substitutable than college graduates and non graduates. 
Like Katz and Murphy (1992) and Card and Lemieux (2001), I find evidence of skill-biased 
technological change in Portugal over the period, as relative productivity between education 
groups increases by 1.6% every year. This is in the range of what Card and Lemieux (2001) 
find for the US, UK and Canada. 


This analysis informs on the large substitutability of workers between age groups and 
education levels, as well as the slow but significant high school biased technological change 
occurring in the Portuguese economy between 1987 and 2017, under simple assumptions on 
supply and demand. Its conclusion is that it is the increase in relative supply of high school 
graduates that causes the decrease in wage premium, in particular for young workers, who 
experience a more important rise in relative supply. 2.14 presents the predicted wage gaps 
by age group, against observed wage gaps in the data. Coefficients estimated with Card and 
Lemieux (2001)’s method fail to match well the wage premium evolution over the period: 
predicted wage is flatter over the period than it is the data, suggesting that imposing linearity 
in the evolution of relative education productivity and constant age productivity across time 


restricts the estimation too much. 


Figure 2.14: Predicted wage gaps between high school graduates and non-graduates of 
same age 
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CHAPTER 3 


Repeated Matching Games: an Empirical Framework 
Chapter co-authored with Jeremy Fox and Alfred Galichon 


Abstract 


How much does expectations of future returns influence agents matching decisions? 
To answer this question in the context of labor markets, where workers and firms 
may both make their employment and hiring decisions depending on future gains, 
we introduce a model of dynamic matching with transferable utility. Agents have 
individual states, or types, that evolve depending on current matches. Each period, a 
matching market with market-clearing prices takes place. We discuss a full equilibrium 
with time-varying distributions of agents types and show how to computationally solve 
for it. We introduce econometric shocks to account for unobserved heterogeneity in 
match formation and show that a stationary equilibrium exists, with and without 
econometric shocks. We propose two algorithms to compute a stationary equilibrium 
with econometric shocks, one that solves the stationary equilibrium equations with a 
non linear solver, the other that reformulates the problem as a min-max problem. We 
adapt both algorithms for estimation, and use the methods developed to estimate a 
model of geographic mobility costs for Swedish engineers. We find that mobility costs 


are impose a sizeable penalty in match production, and evolve non-linearly by age. 
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1 Introduction 


This chapter introduces a tractable model of one-to-one dynamic matching. Agents have 
individual types, such as education and experience for workers, and industry and occupation 
for jobs. When deciding with whom to match, agents account for future expected returns 
that stem from a change in type: for instance, workers expect returns from accumulating 
different types of human capital depending on which type of jobs (technical, managerial, 
etc.) they accept. In turn, this change in type will affect returns from future matches. Each 


period a matching market takes place, where wages act as market-clearing prices. 


Relationship formation is an increasingly studied topic in economics, and matching games 
are a key class of models that predict the formation of relationships. Consequently, match- 
ing games have become important empirical tools alongside the availability of datasets on 
formed relationships. More specifically, economists impose that the observed relationships 
are the solution to a matching game and then use this assumption to provide restrictions 
to base estimation on. Combined with an appropriate model of econometric error terms 
and a computationally tractable estimator, an economist can structurally estimate relevant 
parameters related to the payoffs agents have for the characteristics of potential partners in 
the matching game. However, the literature that structurally estimates matching games has 
almost exclusively, with a few important exceptions mentioned below, restricted attention 
to static matching games. In a static matching game, agents all make matches (or remain 
unmatched), and then the model ends. Static matching games capture market forces in the 
sense that agents compete to make the best matches: for instance, it is hard to get funding 
or quality venture capital managerial advice for a startup if there are many startups relative 
to funding opportunities. Likewise, high-quality car part suppliers take profits away from 
low-quality suppliers. By their nature, static matching games do not model how matches 
today affect agent state variables and hence future matches. In contrast, many datasets 
track how agents’ relationships change over time. In panel data tracking the personal lives 
of individuals, one will observe marriage, divorce, and remarriage. In labor market matched 
employer/employee data, one can track workers as they move between firms and hence track 
firms as they hire and fire workers. Tech workers in Silicon Valley switching firms may con- 
tribute to economic growth. Professional athletes often switch teams, sometimes changing 
their leagues’ competitive landscapes when they do so. In venture capital data, one can see 
the same startup firm returning for subsequent rounds of funding, perhaps with different 
sets of venture capitalists each round. Likewise, the same data show venture capital firms 


investing in different portfolio firms over time. Data on the automotive suppliers providing 
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car parts to particular car models have a time dimension, as different car models are re- 
freshed, with possibly new suppliers, each year. Key to all these empirical examples is how 
match partners affect the evolution of an agent’s state variable. In labor markets, a worker 
may gain on-the-job training and hence make better matches in the future. Entrepreneurs 
might be generalists who require experience in several roles before launching their own firms 
(Lazear (2009)). In supplier/assembler matching, lower-quality car part suppliers partici- 
pating in Toyota’s Supplier Development Program might raise the quality of future parts 
(Fox (2018)). For the funding of young startups by venture capitalists, participating in an 
accelerator program might raise the startup’s quality and the prospects of future rounds of 
funding. In personal relationships, divorce might lead to stigma on the marriage market or 


it might lead to knowledge of how to avoid relationship mistakes. 


To our knowledge, there is not a useful off-the-shelf model from the theory literature that 
generalizes the static matching games mentioned before to a dynamic setting. Of course, 
there is a large and influential literature on search models (e.g., Burdett and Mortensen 
(1998)), but we wish to instead extend static, complete information matching games to a 
dynamic setting. This proposal introduces a formulation of what we call a repeated match- 
ing game. This repeated matching game is a novel model for theoretically understanding 
the formation of matches over time. Also, the repeated matching game provides a tractable 
framework for structurally estimating parameters related to agent payoffs using datasets 
on relationships over time. Our concept of a repeated matching game extends transferable 
utility, static matching games. In this class of static matching games, agents have com- 
plete information about potential partners, and monetary transfers are exchanged between 
matched agents. The solution concept is often pairwise stability or competitive equilibrium, 
which can formally coincide for simple matching games. Static, transferable utility matching 
games have productively formed the basis for many papers that structurally estimate models 
of relationship formation (Dagsvik (2000), Choo and Siow (2006), Fox (2010b), Chiappori 
et al. (2017), Fox et al. (2018), Dupuy and Galichon (2014), Galichon and Salanié (2021)). 


Our repeated matching game operates in discrete time. Each period, there is a set of 
active agents. Each agent has a state variable, which is also the type of an agent in the 
language of static matching games. Making a match, or remaining unmatched can affect the 
evolution of this agent state variable or agent type. Each period, there is a matching market 
with prices or transfers for different matches. These prices clear the market. Given these 
prices, each agent selects the best partner in an, importantly, forward-looking manner. In 


other words, each agent picks a partner today taking into account how the relationship choice 
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affects the agent’s own state variable and hence the profitability of possibly all matches in 
future periods. Next period the matching market reopens, new prices are stated and new 
matches form. Each period should be thought of as long enough for all agents to consider 
exiting a current match and choosing a new partner. Frictions such as switching costs can 
be included if desired, for example as one explanation for sticky matches that last multiple 
periods. A repeated matching game can have both individual and aggregate dynamics. At 
the individual level, each agent is solving a single-agent dynamic programming problem, 
where each period the agent’s action is to choose a partner to match with. At the aggregate 
level, the state variable of the matching market is the active agents’ current set of types or 
state variables. This aggregate state variable evolves with the decisions of the individual 
agents. Matching distribution at the aggregate level is obtained by solving for the social 
planner’s Bellman equation. We expose two methods to compute the social planner’s value 
function and derive equilibrium matching: value function iteration, a cornerstone in solving 
dynamic optimization problems, and a method derived from the field of deep learning, which 


uses neural networks to approximate the value function. 


One of our most important theoretical results is that a stationary equilibrium exists: there 
is a set of agent state variables such that, after optimal matches are chosen by forward-looking 
agents, the same masses of agent types occur. The existence of a stationary equilibrium does 
not depend on model parameters and lets the researcher optionally ignore aggregate dynam- 


ics by imposing that the matching game is at a stationary equilibrium. 


A repeated matching game can be a useful empirical framework for structural estimation. 
We introduce a version of the repeated matching game with econometric errors representing 
unobserved heterogeneity in the preferences of agents for partner types, following Choo and 
Siow (2006). We show that a stationary equilibrium also exists in the model with econometric 
shocks. We then apply two algorithms to compute a stationary equilibrium to the model. One 
method solves a system of nonlinear equations using a nonlinear programming solver. The 
second method uses the Chambolle-Pock primal-dual algorithm. We show that both these 
methods can scale to problems with many agent types. In addition to computing a stationary 
equilibrium, we can extend the same estimators to structurally estimate parameters in the 
production of a match with an appropriate dataset. We use a dataset on Swedish engineers 
moving geographic locations to estimate switching costs by age and geographic distance. 

The repeated matching game with econometric errors can best be explained as the com- 
bination of two touchstone papers in the literature, although of course many other papers, 


including ours, are related. Choo and Siow (2006) proposes an estimator for static match- 
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ing games with logit errors. Rust (1987) proposes an estimator for single agent, dynamic 
discrete choice models, often using logit errors. In our repeated matching game, an agent’s 
discrete choice each period includes whom to match with and faces logit errors for each type 
of partner. The agent’s type in the matching game is also its state variable, as in dynamic 
discrete choice models. After computing the prices in a competitive equilibrium, our model 
of an individual agent’s behavior coincides with the dynamic discrete choice model in Rust 
(1987). In relation with the rest of the literature, We extend classic models of one period, 
one-to-one, two-sided matching with transferable utility to a setting where such a matching 
market occurs every time period (Gale (1989), Koopmans and Beckmann (1957), Becker 
(1973), Shapley and Shubik (1971)). We use econometric assumptions from the literature on 
estimating static matching games with a continuum of agents (Choo and Siow (2006), Chi- 
appori et al. (2017), Fox et al. (2018), Galichon and Salanié (2021)). Our individual agent 
problems are dynamic discrete choice models (Miller (1984), Wolpin (1984), Pakes (1986), 
Rust (1987)). In terms of dynamic matching, Choo (2015) derives closed-form formulas for 
a model where matched agents are exogenously separated from the pool of agents who can 
match. By contrast in our models’ equilibrium, agents endogenously separate based in part 
on the availability of attractive partners. Erlinger et al. (2015) and McCann et al. (2015) 
gave two-period models’ equilibrium, where in the first period an agent goes to school and 
in the second period participates in the labor market. Peski (2021) also focuses on the evo- 
lution of individual agent state variables, in his case with a dynamic search model where 
each period each unmatched agent meets another and accepts or rejects the match. Sepa- 


rations are exogenous and hence unrelated to attractive potential partners, unlike our model. 


Section 2 presents the baseline model of repeated matching games and section 3 de- 
scribes the model with econometric shocks. Section 4 presents our methods for equilibrium 


computation and section 5 our empirical application. section 6 concludes. 


2 The Model Without Econometric Errors 


2.1 Agents and Economywide Variables 


Let « € & be the state of worker, ¥ is a finite set of worker states. We also call x the 
type of the worker, recognizing it can change over time. Let y € Y be a firm state, with Y 
also finite. Both workers and firms have the option to stay unmatched, so that the choice 


set of workers is Vp and the choice set of firms is ¥9. 
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We consider an infinite horizon model, so we drop explicit time subscripts. Note that the 
horizon is the horizon for the entire economy, rather than the horizon for a worker or firm, 
which can be finite by placing worker or firm age in the state variables. We discuss below 
a finite horizon extension for the economy. The worker and firm states evolve according to 
known transititon rules that are function of the current match (x,y). First Pyyjzy = P(x'|x,y) 
is the conditional probability mass function for the worker state x if matched to a state y 
firm. Second, Qyjzy = Q(y'|x, y) is the transition rule for firm state y. For instance, 7 could 
track both general work experience (increasing if not unemployed or y 4 0) and occupational- 


specific experience (increasing for the occupation in the firm state y). 


In the aggregate economy, we keep track of the masses of workers and firms of each type. 
Let m, be the mass of workers of type x, with m = (m,)x<exv being the vector of masses for 
all worker states. Likewise, let n, be the mass of firms of type y, with n = (ny)yey. The 
aggregate state of the economy is (m,n), which contains the masses of all worker and firm 
types. Additional macro states, like demand shifters for the industry being studied, can be 
added to the aggregate state with little conceptual difficulty, although we do not pursue that 


extension. 


In a proposed outcome to the model for one period, let uz, be the masses of matches 
between workers of state x and firms of state y. Likewise muzo is the mass of workers of type 
2eXo.yeNo be the 


matrix of matches of masses. In out discussion of estimation, we will have data randomly 


x who are unmatched and jug, is the mass of vacant firms. Let 2 = (flzy) 
sampled from /1. 


Matched agents exchange monetary transfers in equilibrium. Let w,, be the monetary 
transfer or wage paid by y to x. let w = (Wey) rex yey) be the matrix of wages for a particular 
time period. In estimation, we will not use data on monetary transfers, although work on 


static matching games has explored using data on transfers. 


Given that the aggregate state of the economy is (m,n), an outcomes to the model has 
matches (m,n) and transfers w(m,n) for all possible aggregate states (m,n). The ag- 
gregate state transitions using the matches and the individual state transition rules. We 
use the shorthand notation (Pu,Q) for next period’s aggregate state. We keep the ag- 
gregate transition deterministic for simplicity, although adding stochasticity is conceptually 


straightforward in our framework. 
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2.2 Individual Agent Problems 


There is a common discount factor 6 < 1. If a worker of state x matches to a firm of 


state y at the aggregate state (m,n), the worker receives flow profit 
Any + Way(m, Nn), (3.1) 


where Qzy, is a structural parameter giving the worker flow profit before transfers. If the 
worker is unmatched, they receive a null wage and we assume Qz9 = 0. The worker is 
forward looking and chooses a partner y to maximize the expected, present discounted value 


of lifetime profit, or 


E ii (Qigtyt + Watye(m',n’)) " ; (3.2) 


t=0 
where x’ is the worker’s state variable in a future period, and y’ is the firm partner type 
picked that period. Because the individual state transitions are possibly stochastic, future 
states are random variables. The expectation is over both the future sequence of individual 
state x and the deterministic evolution of aggregate state (m,n). The worker’s problem can 


be analyzed recursively using his or her Bellman equation: 


U,(m, n) = Max {os + Way(M, n) a B Ss" Uz (Pu, QL) Pain| ) (3.3) 


EVo 
e alEX 


where U,(m, 7) is the continuation value for a worker of individual type x in an economy at 
aggregate state (m,n). The sum is over next period’s individual worker states. Symmetri- 


cally, a firm of type y has flow profit 
Yay — Way(m, ), (3.4) 


where 72, is the non-transfer portion of profit accruing directly to the firm. If the firm is 


unmatched, it pays no wage and does not produce: yo, = 0. The firm’s Bellman equation is 


V,(m,n) = max fo —Way(m,n) + BY” Vy (Pu, Qu) oun (3.5) 


yey 


where V,(m,n) is the continuation value of a firm of type y at the aggregate state (m,n). 


Each worker and firm is solving a dynamic discrete choice problem, where the discrete 


choice is a partner type. Other discrete choices, like the decision to undertake an explicit 
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investment to raise a state variable can be added to the model without changing the basic 


mathematical structure. 


2.3. Competitive Equilibrium 


Like the papers on estimating static matching games cited in the introduction, the solu- 
tion concept for our model is competitive equilibrium. A competitive equilibrium is composed 
of matches ju(m,n) and wages w(m,n) for all aggregate states (m,n) such that if zy > 0, 


then the match between worker x and firm y should maximize both agents’ profits, as in 


y € arg MaxXgey, {Org ae Wag(M, n) ate B ee Uz: (Pu, Qu) Pung } 


(3.6) 
rE arg MaAxXze x9 {Yu = Way(m, n) So 8 ies Vyi (Pu, Qu) Quiev} ’ 


m0 | 


where the individual agent’s value functions U and V are implicitly computed by value func- 
tion iteration given the competitive equilibrium. A key result from static matching games 


like Shapley and Shubik (1971) extends this to repeated matching games. 


The decentralized competitive equilibrium satisfies a social planner’s problem, due to the 
transferable utility assumption. The primal problem to maximize the present discounted 


value of economywide profit given initial aggregate state (m,n), or 

t t 
max » BSS ub (Cay + v=) (3.7) 
t=0 x,yEXo Vo 


subject to the feasibility constraints 


> Ln =, Vie and S oe =n Vind, (3.8) 


yEVo rEXo 


and the transition rules 


> Pitan = > dey Vt,x and » Ojegtiy = > ley Vt,y. (3.9) 


x'y'EX Vo yEVo a/y'EXoY LEX 


Solving the social planner’s primal computes the equilibrium matches ('); for each start- 
ing aggregate state (m,n). One can also derive the dual problem, which allows calculation 


of the equilibrium monetary transfers. The Bellman equation for the social planner’s dual 
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problem at aggregate state (m,n) is 


Sere: 0 0 
W(m,n) = os y m,U, + ) NyV, | : (3.10) 
subject to the pairwise stability constraints 


Un VG 2 (Cag ay) 
+85 US (Py, Qu) Potey +B > Vi (Pu QuQyijey Vr,ynt 


a! ECX yey 

UL >B > US (Pu, Qu)Pe2o Va2,t (3.11) 
alex 

Vy 2B S- Ve (Pp, Qu)Qy Oy Vy,t. 
yey 


Once the equilibrium present discounted value of lifetime profit U, and V, are computed 
for all worker and firm types, a set of equilibrium transfers w(m,n) for all (m,n) can be 


computed. 
The result of equivalence between the decentralized equilibrium and the social planner’s 
primal problem is given by the following. 


Theorem 2.1. The matching policy u(m,n) in a competitive equilibrium maximizes a social 


planner’s primal problem. 


Proof. In appendix A. 


The primal problem can be analyzed recursively with the social planner’s Bellman equa- 


tion 
W(m,n) = max S- Hey (Gey + Yay) + BW (Pp, an| (3.12) 
“= KayeXo Vo 
subject to the constraint 
S- fia = ie Ve: ad Lig TG VU (3.13) 
yEVo LEX 


The right side of Bellman equation (3.12) is a contraction, so a unique present discounted 


value for economy wide profit on each aggregate state (m,n) exists across all equilibria. 


Theorem 2.2. A competitive equilibrium exists and the economywide sum of future profits 


W(m,n) is uniquely determined. 
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Proof. In appendix A. 


Note that in many parametrizations, the matches ju(m,n) are also uniquely determined 


across all competitive equilibria. 


Typically, the aggregate state (m,n) varies from period to period. The time series (m,n) 
is deterministic, given a starting value for the aggregate state. The entire competitive equi- 
librium depends on the model parameters: a, y, 0, P and Q. Inspecting the above notation 
indicates that only the sums azy + yz, matter, so we can restrict attention to the flow 


production of a match, defined to be 


54 = Cig + ay (3.14) 


2.4 Stationary Equilibrium 


Define a constant aggregate state as an aggregate state (m,n) such that in a competitive 
equilibrium 
m= Pulm,n) and n= Qu(m,n). (3.15) 


In other words, a constant aggregate state (m,n) remains at the value (m,n) forever after 
that state is reached. A stationary equilibrium is then defined to be a constant aggregate 
state (m,n) and the corresponding competitive equilibrium (jz,w). The theoretical result 


for existence follows. 


Theorem 2.3. For any total mass of agents M, a stationary equilibrium exists. 


Proof. In appendix A. 
Corollary 1. For any total mass of agents M, an aggregate state exists. 


Working with time-varying aggregate state or restricting attention to a stationary equi- 
librium is a modeling decision of the researcher. By assuming that the model is at stationary 
equilibrium, firms will not be concerned about how the distribution of worker types changes 
over time. The entire focus will be on the evolution of state variables such as the experience 
levels of individual workers and firms. Other researchers may wish to study the aggregate 
dynamics of the repeated matching game. Workers and firms may indeed track the overall 
distribution of agent types. In macro, models with both heterogeneous agents and aggre- 
gate dynamics are common (Rios-Rull (1995), Krusell and Smith (1998)). In the dynamic 
games used in industrial organization, agents best respond to all other agents that track 


their own states and best respond to an analog to a constant aggregate state (Weintraub 
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et al. (2008)). As stated before, one can include additional state variables, like industrywide 


demand shifters, in the repeated matching game framework. 


Real datasets often have agents entering and leaving a matching market. For both labor 
and marriage markets, agents become adults and later either retire or die, exiting the match- 
ing market. For venture capital markets, new startups enter and may permanently exit each 
year. Modeling an exogenous process for entry and exit adds some notational complexity 
but is conceptually easy to add to the previous description of the repeated matching game. 
Note that while in some cases (say a worker needing to retire) the horizon of an agent is 


finite, the overall repeated matching game still has an infinite horizon. 


2.5 Finite Horizon for the Economy 


It is straightforward to have individual workers and firms with finite horizons. Adding 
a finite horizon for the economy changes how one solves the primal problem. The social 


planner problem in (3.7) becomes, with a finite number of periods T 


fi 
no | oa » glob} (3.16) 
ne t=0 


x,yEXo Vo 


Using computational experiments, we have found that it is best to solve this finite-horizon 
planners’ problem directly as written, rather than writing the planner’s problem recursively 
using Bellman’s equation. There is not likely to be a stationary equilibrium for a finite 


horizon problem. 


3 The Model With Econometric Errors 


The previous model will often predict that some matches occur with mass of zero, meaning 
Hazy = 0 for some types x and y. This contradicts available datasets where, with enough 
observations, it may be the case that jizy is rarely or never zero. This contradiction is solved 
by accounting for an econometric error terms in the flow profits of both workers and firms. 
This error term covers variables that matter for match formation, but are unobserved to the 
econometrician. In this section, we focus on a stationary equilibrium, for simplicity, although 
the full model with time-varying aggregate states can also be extended to have econometric 


errors. 
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3.1 Econometric Preference Shock 


Let the flow profit for a worker 7 of type x be 
Any + Way + €iy, (a7) 


where €,, is worker 7’s preference shock for type y partners. Worker 7 is indifferent between 
all partners of the same observed type y. Likewise, let the flow profit for a firm j of type y 
be 

Yay — Way + No}; (3.18) 


where 7),; is firm j’s preference for workers of type z. We make similar assumptions to Choo 
and Siow (2006) for static matching games and Rust (1987) for single-agent dynamic discrete 


choice models. 
Assumption 3.1. The econometric errors satisfy the following assumptions: 


1. For each pair of two workers i and 1’, (€iy) and (€;ry) are mutually independent 


YEVo yEVo 
in every time period and across time period. The same mutual independence condition 
holds for a pair of firms or a pair of one worker and one firm. 


2. For a single worker i in the two time periods t and t+ 1 with measured states x} and 


ait! the distribution of (Cis ee satisfies the following conditional independence, 
meaning 
t+1 t .t+1 (t = t+1 t+1 
F (Gs eds ane (4) sey, ) =i (G Jacke. |= (3.19) 


A similar conditional independence assumption holds for firms. 


In other words, knowing one’s own preferences for measured partners types is not infor- 
mation about other agents’ preferences. Under this assumption, it is irrelevant whether the 
preference shocks €;, and 7,; are public or private information to other players. Also, the as- 
sumption states that preferences are drawn anew each time period conditional on measured 
states x or y, rather than being statistically dependent over time. Relaxing conditional inde- 
pendence from Rust (1987) can be done in several ways, including allowing for time-invariant 
unobserved types or using instrumental variables, as in Berry and Compiani (2020). We do 
not formally explore weakening Assumption 3.1 in this paper. Unmeasured preferences in 
the literature on estimating static matching games with a small number of matching mar- 
kets, each with a continuum of age,ts, are typically preferences over measured partner types 
x or y rather than unmeasured preferences attributes (Choo and Siow (2006), Dupuy and 
Galichon (2014), Chiappori et al. (2017), Fox (2018), Galichon and Salanié (2021)). This 
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contrasts with a data scheme of many smaller markets, where agents could have preferences 


over unmeasured (in data) attributes of partners (Fox et al. (2018)). 


3.2 Competitive Equilibrium With Econometric Errors 


Like in the model without econometric errors (or shocks/heterogeneity), we show that 
the matching policy in the competitive equilibrium with shocks maximizes a social planner 


problem which writes as 


max 1S a ( > MoyPay — E(t sm!) s.t aa} (3.20) 
o t=0 x,yeXo Vo 


Cy 


The term € is the expectation of the econometric errors. It is referred to as the entropy 
(a term borrowed from the optimal transport literature). Its final form depends on the 
distributional assumptions on € and 7. The social planner problem is subject to the same 
feasibility constraint and transition rule as in the model without heterogeneity. However, 
feasibility constraints (3.8) need not appear explicitly: the entropy ensures they are satisfied. 
if not, the value of entropy goes to infinity. The problem also rewrites as a social planner 


Bellman equation: 


W(m,n) = ms | So pay Pey — Euan) 1s (ence (3.21) 


LL 
- ryeXo Vo 


Define expected indirect payoffs G, and HA, 


G,(u’) = 2 [imax {Uny + «}] and H,(v')=E [ana ne + | (3.22) 
and total expected indirect payoffs 


=> mG) and H(v",n*) a H,( (3.23) 


LEX yey 


Then the entropy in social planner problem (3.20) is 


E(u’, m,n) a G*(u",m') + A*(y',n’), (3.24) 
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where (G*)" and (H‘)* are the entropy of choices: 


G* (ut, m*) = | >» Peyttay — Gtunn)} 


xyeEXoy 


a (as n) a SS PeyYay — nn 


xyYEX Vo 


(3.25) 


The dual to social planner problem (3.20) writes 


min Som} Gz(u') + a Lyk (3.26) 


May Ey LEX yey 


subject to the pairwise stability constraints 


i Uny = Oy +68) Gu (u re) Prey + BY. Hy (v Oy ay Vie 
wEX yey 
ug 2 B » Gy (u‘**) Py x0 Va (3.27) 
BEX 
Udy = Pa B S- Hy ye") ) Qy Oy Vy 


y'EY 
The model with heterogeneity affords the same result as without heterogeneity: the compet- 
itive equilibrium can be found through solving the social planner problem. 


Theorem 3.1. The matching policy in a competitive equilibrium with heterogeneity maxi- 


mizes social planner problem (3.20) subject to (3.8) and (3.9). 


Proof. In appendix A. 


Writing the model with heterogeneity lets us express the competitive matching policy in 
closed form, provided we assume a distribution for the econometric errors. We follow the 


literature, and in particular Choo and Siow (2006) and Rust (1987) and assume the following 
Assumption 3.2. Econometric errors € and) are Extreme value 1 distributed. 


Under Assumption 3.2, expected indirect payoffs have logit form (Galichon and Salanié 
(2021)): 


= log >, exp(uj,) and H,(v') = log > exp(v,) (3.28) 
yEVo LEX 
and the entropy € is 
_ Hay May 
LYEX Vo 7 xyEeXo VY y 
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Proposition 1. Under Assumption 3.2 competitive matching pu‘ is 


®, ce , et Pal ai / yi No —Ut-vVi 
ty = yma ( yt B dwen ay pais y Qy'ley y 


[Lig = Me EXP (> So UE Patey — “) (3.30) 


ulEX 


Moy = Ny CXP (> S- VO as —_ “) ’ 


yey 


where (U',V*) and (U‘*1, V**") are the expected indirect payoffs in period t and t +1. 


Proof. (4)cyexo yo derives from first order conditions of social planner problem (3.20). 


3.3 Stationary Equilibrium With Gumbel Errors 


Assuming econometric errors are Extreme Value 1, as in (3.2), we are able to show that 
a stationary equilibrium exists for any total mass of agents /. The result is the same as in 
the model without heterogeneity, although the method of proof is different, as is detailled in 
appendix A. We do not explore the existence of stationary equilibria outside the logit case in 


this paper, although it may be that a more general proof exists for any distribution of € and 7. 


In the model with Gumbel econometric shocks, aggregate state (m,n) is constant and 


associated matching policy py is part of a stationary equilibrium if and only if 


m=Pu and n=Qpu 
(u, U,V,U',V’) satisfy relation (3.30) (3.31) 
Vet and VaV" 


Note that in a stationary equilibrium, indirect expected payoffs to each agent type are the 
same in every period, and equal to the Lagrange multiplier of the stationarity constraints, 


up to a constant. 


Using (??), we are able to establish the following existence result. 


Theorem 3.2. For any total mass of agents M, a stationary equilibrium and aggregate state 


exist in the model with Gumbel econometric errors. 


Proof. In appendix A 
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As in the model without heterogeneity, a stationary equilibrium and aggregate state exist 
for a given total mass of agents. In the next section, we present different methods to compute 


both the equilibrium with aggregate dynamics, and the stationary equilibrium. 


4 Methods for Equilibrium Computation 


This section develops methods for computing equilibria both in a non-stationary and 
stationary environment, given transition rules and a total mass of agents. In the non- 
stationary environment, we rely on value function iteration, making use of the social planner 
Bellman equation (3.21). We also develop a method that builds on neural networks. In the 
stationary environment, we develop two methods to compute the equilibrium: one uses the 
Mathematical Programming with Equilibrium Constraint (MPEC, see Su and Judd (2012)) 
formulation of our problem, and the other rewrites the stationary equilibrium equations as 
a min-max problem and solves it using techniques from convex optimization (Chambolle 
and Pock (2011)). We also show how to adapt these methods to estimation using data on 


matches, assuming that the data represent a stationary equilibrium. 


4.1 Equilibrium with Aggregate Dynamics 
Value Function Iteration 


Equilibrium in dynamic models is classically found using value function iteration (VFI) 
on the social planner’s Bellman equation. We expose it here in our model with econometric 
errors, but it also applies in the case without heterogeneity. Value function iteration operates 


on a grid of aggregate states ((my,7,)) _qy Where G is the chosen number of points in 


g€{l,.. 
the grid, and chooses an initial W° on each point of this grid. It then proceeds to update 


W' as follows 
W**" (nig, ig) =T (W (mg, 7g) VQ, 


3.32 
where T(W(m,n)) = max > Pay Say — E(u, ml) + BW (Pp, an| / a 


May29 
xyeEXo Vo 


Because map T is a contraction, (W‘), eventually converges to the fixed point of the social 


planner’s Bellman equation (3.21). 


Note that because (Pj, Qi) does not necessarily land on a point of grid ((mg, Ng) gef1,....0}) 


some interpolation technique is needed to compute the value of W‘ at this point: we can use 
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polynomial interpolation with either simple or Chebyshev polynomials. 


Value function iteration is a robust way of computing the equilibrium with aggregate dy- 
namics: once a fixed point W has been found, the competitive equilibrium can be computed 
on any point of the grid. Its main drawback is that it suffers from a curse of dimensionality: 
the larger G, the longer each iteration, because to update from W* to W*‘*! the optimal 
matching policy 44 must be computed on every point of the grid. Ideally G would be as 
large as possible to obtain a precise value of W. Within this ‘outer’ curse of dimension- 
ality, lies an ‘inner’ curse: the larger X and Y, the number of types of workers and firms, 


the longer each individual maximization on the grid will take, as the dimension of ju increases. 


These two issues can be alleviated in two ways: the ‘inner’ issue can be tackled using 
recent optimization solvers to solve the maximization over pz. We use the Nlopt package, 
which is available in a wide array of programming languages, and find that the maximization 
on each point of the grid is solved quickly. Besides, because each maximization problem is 
independent, the loop running through all grid points to solve them can be parallelized. Par- 
allelization partly solves the ‘outer’ curse of dimensionality. Finally, the numerical analysis 
literature (Fang and Saad (2009), Walker and Ni (2011)) provides a wide array of methods 
to accelerate fixed-point iterations. In our own value function iteration, we use the Anderson 


acceleration method. Its main idea is to use not only W‘ to update to W‘*!, but also W‘!, 


Wt, ... up to some threshold m decided by the analyst. Formally, Anderson acceleration 
writes 

wit! —wt—v'f' (3.33) 
where! 


ft =T(W') Ww! 

Via 1+ (Wit FY) ((F)" Fy" (FT 
Wi = (Wem em, wt wet) 
Fe = (frome _ ftom ft ft), 


Note the similarity of Anderson’s acceleration with a quasi Newton descent, in which the 


(3.34) 


hessian matrix is approximated by the inverse of the Jacobian of f(W) = T(W) — W. See 
Walker and Ni (2011) for an extended discussion of this point. 


TAs long as t < m, the definition for W' and F° is adapted to include all terms since the first iteration. 
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Artificial Neural Networks 


Instead of using value function iteration, which relies in a grid and is therefore subject to 
a curse of dimensionality, one can also use the deep learning and neural networks literature? 
to find a fixed point to T, where T is defined as in (3.32). This idea somewhat alleviates the 
curse of dimensionality, because it does not rely on a grid. It also performs better in terms 
of accuracy on off-grid points. It is starting to be used in economics with promising results 
(see for instance Barany and Holzheu (2021)). 

Let us draw a sample of masses of size N: ((mi,i))iep,.ny- Let ANN (mi, nj|@) be the 
neural network representation of W((m;,7;), given vector of parameters 0. Then to find a 


fixed point of 7’, one can minimize the following loss function on the sample 


min ~ (ANN (mi, n4|0) — T(ANN(mi, ni|9)))? (3.35) 


7 


Such a minimization problem is readily implemented in many libraries in Python (Pytorch, 
TensorF low) and Julia (Flux). 


Comparison 


Using Anderson acceleration on the value function iteration brings significant speed gains: 
in a model with two types on each side of the market, ‘classic’ value function iteration 
takes more than 5 hours on a 6 Cores laptop, while Anderson value function iteration takes 
only 93 minutes. In both methods, simple polynomial are used for interpolation between 
grid points and maximizations on grid points are parallelized. Comparison between value 
function iteration and deep learning is less straightforward because the methods differ on 
several levels. First the ‘procedure’ is different: VFI iterates on the grid until it has found a 
fixed point to the value function, while the deep learning method aims at finding the fixed 
point by minimizing a loss function. Second, the rely on different types of grid: VFI rests on a 
rectangular grid and a given family of polynomials for interpolation (simple, Chebyshev, etc.), 
while deep learning uses a sample of points and neural networks for interpolation. The choice 
of polynomial family for VFI and neural network for deep learning impacts both the speed 
of convergence, and the out-of-sample precision. Finally, the two methods are implemented 
using different tools: value function iteration can be accelerated using parallelization on 


the computer’s cores, while estimation of artificial neural networks can be sped up using a 


?This section owes to Julien Pascal’s excellent blog post on the topic of artificial neural networks: https: 
//julienpascal.github.io/post/ann_2/ 
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Graphical Processing Unit (GPU). In the end, the choice of method is up to the researcher, 


and should depend on the equipment available to her. 


4.2 Stationary Equilibrium with Constant Aggregate State 


We present two numerical methods to compute the constant aggregate state and asso- 
ciated stationary equilibrium in the model with econometric errors. We benchmark them 
against iterating over the social planner’s value function W once it has been found by value 
function iteration. The main benefit of these two methods over value function iteration is 
speed and tractability, as will be made explicit below. We show that these methods can be 
adjusted in order to estimate the model parameters using data on who matched with whom. 
It is easy transition from computing a stationary equilibrium to estimating the model param- 
eters while assuming that the data represent a stationary equilibrium. The minimal dataset 
comes from one market in stationary equilibrium and has cross sectional data on 2, y, x’, 1 
for matches x,y from that market, where x’,y' are the states of the two matched agents 
at the beginning of the next period. Datasets with lengthier panels can also be used. In 
estimation, we assume that the transition rules for the individual worker and firm states P 
and Q are estimated in a first stage, as often done in dynamic discrete choice models (Rust 
(1987)). We focus on a second stage in which structural parameters are estimated. We first 
use the data on matches x,y to estimate the matching probabilities fiz, as well as fiz9 and 


(toy for unemployed workers and vacant jobs, respectively. 


Mathematical Programming with Equilibrium Constraints 


Mathematical Programming with Equilibrium Constraints, or MPEC, has been used by 
Su and Judd (2012) to estimate the dynamic discrete choice model of Rust (1987), and by 
Dubé et al. (2012) to estimate the aggregate demand model of Berry et al. (1995). MPEC 
formulates model solving or estimating parameters in a model as a constrained optimization 
problem, requiring nonlinear programming to numerically solve. Here, we apply MPEC to 
the two problems of computing a stationary equilibrium and estimating structural parame- 


ters. 


To compute the stationary equilibrium, we solve the following set of equations for un- 
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knowns (U,V), (m,n): 


Ss ftay(U, V, m, n) = Ma and S° ploy (U,V am n) — Ty 


yEVo rEXo 
S- Pytjeybay(U, V, m, n) = Me! and > Qy |cyblay(U, V, 1s n) = yt (3.36) 
ryecXoy LyEX Vo 
2 $0 pay(U,V,m,n) + S> pao(U, V,m,n) + >> Hoy(U,V,m,n) = M. 
rycx VY LEX yey 


Note that these are simply the feasibility conditions, stationary transition rules, and nor- 
malization equations outlined previously. In the model with Gumbel errors, by Proposition 


1 the stationary matching policy is 


©, + / U2 Petey + pee ees (ean 3 
bay( Us Venn) = yay ex ( yt B doen ley eae yQy!ley “) 


La2o(U, V, Mm, n) = Mz EXP (: S- Ouray _ “| 


LEX 


Hoy(U, V,m,n) = ny exp (> Vy Qy \cy — “) 
y'EyV 
(3.37) 

To solve the system of equations (3.36) we use a nonlinear solver to maximize a constant 
function (say 0) subject to the constraints (3.36). We discuss in the next section our preferred 
choice of non-linear solver. 

We now turn to estimation using MPEC. The functional form of the match production is 
parametrized by ®,, = >>, APby where phy is the k*” basis function and the vector \ = (A), 
contains the coefficients on the basis function. The log likelihood for the data is 


2S) fay log fay(d,U,V,m,n)+ >> flo log pao(A,U,V,m,n)+ © floy log Hoy(A, U, V,m, n), 
ryex Y LEX yey 


(3.38) 
where again fi is the observed matching in the data. We maximize the log likelihood (3.38) 
over variables (U,V), (m,n) and A subject to the constraints (3.36). This obtains the 
maximum likelihood estimation for A, and is just a simple modification (adding an objective 


function) of the original program we have to compute the equilibrium in JuMP. 
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Reformulation as a min-max problem 


Consider the following saddle-point problem 


max min Z(U, V,U,V,m, 7, 8), (3.39) 


mn U,V 


where 


Z(U,V,U',V',m,n, 8) =- Some — So ry 


rex yeY 
+2 S © ptey(U,V,U',V',m,n, B) 
ryEeX Y 
+S 7 Heo(U, V,U',V’, m,n, B) + > _ Hoy(U, V,U", V’, m,n, B), 
LEX yey 


(3.40) 
and yu is defined as in (1) under Gumbel assumption 1 with ju’s dependence in 3 having been 


made explicit. Problem (3.39)’s first order conditions are 


AZ OZ 
A = De Heyl V,U',V', m,n, 8); au = 8 DPV 2) =, 
» yEVo ryexroy 
OZ OZ 
BV, = — S© pey(U,V,U',V',m, n, 8), av = BS) Qyteytey(U,V,U',V', m,n, B) =0, 
LEX ryEX Vo 
VA 
< = =14 Yt UV." V',m,n, B) = 0, 
Ma * yEVo 
OZ 
Fey BA teal V’,m,n, B) = 0. 
ty Y rEXo 


(3.41) 

When £ is 1, these are exactly transition rules and feasibility conditions for aggregate 
steady state. One only has to solve problem (3.39) to find the stationary equilibrium and 
aggregate steady state. There exists easily implementable methods to solve min-max problem 
such as the Chambolle-Pock algorithm (Chambolle and Pock (2011)). However because in 
general 6 < 1, we cannot straightforwardly solve (3.39), as it is not the optimization problem 


whose first order conditions give us the stationary equilibrium: in He a nd of 


a 6 appears 
that takes us away from the stationary transition rules. To tackle this ee we adapt 
the Chambolle-Pock algorithm to the case when § < 1 in order to cancel the extra 3 that 


appears in the first order conditions (3.41). To solve for (U,V), (m,n) that satisfy the 
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stationary equilibrium equations we choose step € and we do 


Att ot oa 


att! — ant — nt-1 
ue =U'—e (OuZ ae ae Ur ae net ne 8)) oF BOnZ (O% Vv‘, U', Vs, ety ne 8) 
ye =V'—e (OvZ (Os Vi, Ds ae ian a 8)) 7 B02 (U*, a Os aa mt, ae 8) 
mt _ mt zs COE (oe, ap oa ye, m, nt, ) 


nitt = nit + €O,Z er er a9) . 


(3.42) 

Note that the algorithm is similar to an alternate gradient descent, which updates al- 
ternatively the parameters on which to minimize and those on which to maximize. The 
only difference is that instead of updating minimization parameters using the actual value of 
maximization paramters, we take an average between their actual and previous value. This 
simple adaptation is what ensures convergence of the algorithm in the case where 6 = 1, 
see Chambolle and Pock (2011) for more details. Note also that total mass of agents can 
be normalized through an additional equation, which simply adds a parameters on which to 


update the algorithm. 


The min-max reformulation is easily adaptable to estimation. As for MPEC, assume the 
match production’s functional form of is ®,,(A) = }>,, Ax@%, and ji is the observed matching 


in the data. Then we can estimate (Aj;,),, using moment condition 


>, Sila= >|. Chie. Ve (3.43) 


xyEXo Vo xyEXo Vo 


To include this condition in the min-max formulation, simply write a new function Z: 


Z(U,V,U',V',m,n,B,)=—- So m.— Son, 


rex yey 
+2 S- ba (U; V, U’, Vs, m,n, , X) 
ryEexy 
Te S eo(U, V, U", V',m,n, B, A) (3.44) 
LEX 
a > fog, V, Us Vv", m,N, B, X) 
yey 
— So) Gyy(A)fiey(U, V,U", V’, m,n, B, d). 
xyeXo Vo 


Then first order conditions to the saddle-point problem 
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max min Z(U,V,U,V,m,n, B, A) (3.45) 


are the same as (3.41), plus moment conditions (3.43). One only needs to adapt the algorithm 


in (3.42) to include the moment condition to estimate X. 


Comparison 


We now proceed with some speed and precision comparisons between the two methods, 
MPEC and min-max. Speed is of course heavily dependent of the machine used, as well as 
of the solver chosen to solve the MPEC set of equations. The following benchmark are run 
using a computer with a 6-Core Intel processor, and 32GB of RAM. We define the MPEC 
optimization problem in the problem definition language JuMP (a Julia package) and solve 
the system using the nonlinear programming solver MadNLP? based on Shin et al. (2020) 
which relies on interior point methods to solve non-linear optimization problems. Note that 
much of the speed gains from MPEC rest on the use of the programming solver: for instance 
the Ipopt solver performs remarkably less well than MadNLP, although it is often used to 
solve non linear problems and relies on the same interior point methods as MadNLP. On 
the other hand, the min-max formulation and adapted Chambolle-Pock algorithm are more 
robust to implementation choices, as they can easily be coded from scratch by the researcher 


in any computing language of her choice. 


Table 3.1 and 3.2 reports both methods’ speed of convergence for equilibrium computation 
(assuming match surplus is known) and estimation depending either on the number of types 
of agents on each side of the market (for equilibrium computation) or on the number of 
surplus parameters to estimate (for estimation). Both methods are extremely fast when the 
number of types or parameters is small. They both slow down as numbers grow, although 


the min-max method remains fast. 


Table 3.1: Equilibrium computation: methods’ speed depending on the number of types 


Method | nbx = 2, nby=2 nbxr=10, nby=10 nbs = 30, nby = 30 
MPEC 0.004s 1.168s 408.770s 
MinMax 0.048s 1.805s 35.8718 


Average speed in seconds over 10 runs 


3Downloadable from Github: https: //github.com/MadNLP/MadNLP. j1 
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Table 3.2: Estimation: methods’ speed depending on the number of parameters 


Method | nbk=2 nbk=10 nbk = 30 
MPEC 3.0778 11.918s 35.910s 
MinMax | 0.751s 2.0448 9.2118 


Average speed in seconds over 10 runs 
nba = 10, nby = 10 


Table reports the largest standard error obtained with each method depending on how 
big the sample from which fi is obtained is. Both methods yields satisfactory standard errors 


when the sample size is large. 


Table 3.3: Estimation: methods’ estimation precision depending on sample size 


Method | obs = le4 obs =1e5 obs = 1e6 
MPEC 0.215 0.062 0.020 
MinMax 0.216 0.063 0.020 


Largest standard error over 10 runs 
nbz = 10, nby = 10, nbk = 10 


5 Empirical Application 


To illustrate the usefulness of our model applied to labor data, we estimate moving costs 
for Swedish engineers in the 1970s by age and geographic distance. We show costs to switch- 


ing region for work are sizeable, and non linear in age. 


The dataset used covers a subset of Swedish engineers and the firms they work for from 
1970 to 1990 (see Fox (2009), Fox (2010a) for more background on the data). Observations 
are at individual times year level. Workers and firms are observed each year with a unique 
identifier. The data contains a number of characteristics on both workers and firms, among 


which worker age and firm location. Worker and firm type are defined as follows: 
x = {age, previous location} and y = {location}, 


where the previous location for a type x workers is their employer’s location in year t — 1, 
which is assumed to become the worker’s location in year t, and location for type y firms is 


its geographic region in year t. Both ages and locations are aggregated from the data: we 
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assume 5 different ages in the model: up to 25 years old (age of 1), from 26 to 35 (age of 2), 
from 36 to 45 (age of 3), from 46 to 55 (age of 4), 56 years old and more (age fo 5). Location 
is provided in the raw data as one of Sweden’s 23 counties. We aggregate these into larger 
areas based on county location to obtain four regions: Stockholm, counties immediately next 


to Stockholm, counties in south Sweden, and counties in the center and north of the country. 


As a result, there are 20 types for workers and 4 types for firms: nbx = 20 and nby = 4. 
Following Fox (2010a), we are interested in evaluating switching costs between regions by 


age. Our match production function is 


5 
P(x, y) = > Nal rage=a] dist (Zz, y), 


c=] 


where dist(x, y) is the average distance between counties that make up the worker’s previous 
region in x and the firm’s region in y. We are looking to estimate parameters A, for each 


age bin a. 


Let (ian ' be the observed matching in year f, i.e. [es is the empirical probability of 
randomly drawing a match between a worker of type x and a firm of type y in the observed 
data, Seb tig = a iol to be the average of this probability over the decade. We 
do not observe unemployed workers nor firms with no employee and hence include no un- 


matched agents in the estimation. 


Transition matrices are estimated directly from the data, using f. Workers transition 
deterministically from their location at t — 1 to their employers’ location at t, so that 2j,. = 
Yj... We assume a probability p = .1 of ageing from age a to age a+ 1. Engineers regularly 
leave the market either to retire or work in industries that are not accounted for in the data, 
and a number of firms also leaves the market every year. Therefore we assume some attrition 
in both populations. We compute attrition rates 6, and 6, by worker and firm type. They 
are taken to be the average of the share of agent of each type who leave every year between 


1971 and 1980. Worker transition matrix P is a 20 x 80 matrix with entries 
(1 _ A) pbeay if Lage = Ts and Yloc = is 


os ~ 5 / / 
Prisy = \ Poaitey if Lage + 1 = ae atid Yiee = Line 


0 otherwise. 
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Firm type transition matrix Q is a 4 x 80 matrix with entries 


Q ; = Oyblay if Yloe = Yioe 
ey 0 otherwise. 


In order to remain on the stationary framework despite workers’ and firms’ attrition, we 
introduce an incoming flow (i,), for workers and (i,), that is computed to compensate loss 


from attrition based on observed matching: 
tg! = a platy _ S- Pei\zyblay and by! = S- flay! _ S Qy!|zyltay- (3.46) 
y vy x xy 


Accounting for attrition and incoming flows slightly modifies the stationary equilibrium 


equations that become: 


Ss Lay(U,V,m,n) =m, and S- Lal U, V7, 0) = Ny; 


yEVo LEX G 
> Pa lpyliay U, Vn Rh) = Mg iy and y Oy iegtial U,V) = ay, 
ryeXoy xyEX Vo 
2 S- ag WU, Y ™, n) zs S- Lao(U, V, mM, n) aug S- Loy (U, V, Mm, n) = M, 
ryex Y LEX yey 


(3.47) 
where jz is unchanged as in (3.37). 


To complete the estimation, we assume discount factor G = .95. Point estimates obtained 
with Chambolle-Pock and MPEC are as follows: 


Table 3.4: Estimates for moving cost by age bin 


Ay A2 A3 Ma A5 
Chambolle-Pock | -51.81 -49.86 -49.10 -47.28 -52.97 
MPEC -48.42 -48.88 -51.22 -48.31 -50.32 


Estimates are non linear in age: moving costs decrease as workers become older, except 
for 56 years old and more who face higher moving costs with respect than their younger peers. 
Because unemployed workers and vacant firms are not accounted for in this estimation, match 
production is only identified up to a constant. Thus estimates are only interpretable relative 
to each other: for instance, moving when a worker is between 26 and 35 years old imposes an 
extra penalty of about 2 on production per extra kilometer compared to when a worker is up 


to 25 years old. To interpret these results, we should also keep in mind that switching costs 
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apply to total match surplus, which accrues both to firm production and worker amenity. 
Hence switching costs can be due either to a immediate loss in match production for the firm, 
or to a dislike for moving to a different region from workers. It is likely to two combine to 
different degrees depending on age, which might help explain the non-linearities in switching 


costs by age. 


6 Conclusion 


In this chapter, we develop a model of dynamic matching with a wide range of applica- 
tions, to labor, family economics and industrial organization. The main features of the model 
are that 1) the matching game is repeated every period and 2) agents are forward-looking 
and account in changes in their state variable caused by who they match with in the present 
period. We expose the model without and with econometric shocks, and show equilibrium 
can be found in both cases by solving the social planner Bellman question. Importantly, 
we show that a stationary equilibrium, which does not change the distribution of agents’ 
state variables in every period, always exists both without and with econometric shocks. 
Then, we provide methods to compute both the dynamic and the stationary equilibrium: 
equilibrium can always be found in general by value function iteration on the social planner 
Bellman equation, or through deep learning methods that fit artificial neural networks to 
minimize a loss function that approximate the social planner Bellman equation’s fixed point. 
Then, we introduce two methods to compute the stationary equilibrium: by solving for the 
stationary equilibrium equations using a non-linear solver, or by reformulating the equations 
as a min-max optimization problem. We compare the two methods in terms of speed and 
precision, and show that the min-max reformulation is faster when the number of types is 
large, although the two methods have very similar precision. Finally, we adapt our methods 
to estimation and apply them to estimating geographic mobility costs for Swedish engineers 


in the 1970s. We find the mobility costs are substantial and non-linear in age. 
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A Proofs 


Theorem 2.1 
The central planner problem (3.7), (3.8), (3.9) has dual: 


0770 0170 
se { Sots + So nivy (3.48) 


LEX yey 


subject to stability conditions 


UL + Vi > (dey + ey) +B >) US Petey +B VE Oytey VEY 


EX yey 
t t+lp 
UL2>+B UL" Paizo Vt,x (3.49) 
EX 
Vi=tB SVM Qytoy Vtry- 
yey 


Let (u'), be a solution to primal problem (3.7), (3.8), (3.9) and (U‘, V"), be a solution to 
dual problem (3.48), (3.49). Introduce (w'), such that 


= Oey — 8 >) UE Paey > why > —Vi t+ thy +B > VE" Ouney: (3.50) 


aECX y’EY 


Then (i', w'); satisfies (3.6) and is therefore a competitive equilibrium. 


Conversely, assume (ju',w!); be a competitive equilibrium, i.e. it satisfies (3.6) for 
(U',V‘), defined through (3.3) and (3.5). Then wage (w', w'); satisfy primal feasibility 


ji AO Nise 

eye Hey = mi, Vt, x and Deas Hy * ny Vt, y 

Domed Vo Polaty! Maty! — mit Vt, xv and eAe® Qula'y’ Mey — mit Vt, Y, 
(3.51) 


dual feasibility 


Ulse Vy > ager Wa) 8 a ey Oa Paley Oey Ve Cvian, Vy 
UE 2 +f eee OF * Pet e0 Vie (3.52) 
Vi 2 +8 Nis Vir Qyioy Vt, Y, 
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and complementary slackness 


eg OLS UP a= Vix 
Moy (Vj — Briyey Vii"! Quiou) =0 Vt,y, 


(3.53) 


for the social planner problem and its dual. 


Theorem 2.2 
We show that the map y(W) > maxyem(mn) ae yy Mey (Gey + Yey) + BW (Pp, Qu)} 
is a contraction, where M(m,n) = {ey > 0 eas [leg =I, eee (eg = VE, yh. 


Fix (m,n), consider two different continuation values W, W’ and define 


ju € arg max SS) Hey (Cay + Yey) + BW (Py, an| (3.54) 
uM Cnn) ryEXo Vo 
then 
y(W) — o(W") < B(W(m, 2) — W"(m, A) (3.55) 


where (m,n) = (Pil, Qj). Hence 
pW) — pW") < Bmax (Wn, A) — WR, 7) (3.56) 


where max, is taken over (m,n) such that }7, Mz = )/,Ms and 7, Ny = D7, Ny. Because 


the above equation is true for any (m,n) we have 
le(W) — e(W")|| < BIW — Ww" (3.57) 


where ||.|| is the sup-norm. Therefore y us a contraction for the sup-norm, and a fixed point 


of @ exists and is unique. 


Theorem 2.3 
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A competitive equilibrium (sj, w) is defined through equations: 


May 20, Pro 29, Moy 20 Va,y 

Uz + Vy 2 (Gay + Yay) + 8 Dwtex Uz Pr'jay + 8 yes Vy Qyjey V,y 

Uz > BY wex Ue Patan VX 

Vy = B LHe Vy Qyjoy VY 

Hay > 0 => Uz + Vy — (Gay + Yay) — B Lae Ue! Potjay — B Lyrey Vu yay =0 Va,y 

piss PUSS CABS ee Up Pigg 0 Ve 

Loy > 0 = Vy — 8 page) Vy Qy joy =O Vy. 

(3.58) 

(1, w) is stationary iff 


SS May = S- Pyjary' Maty! Va and S- May = S- Qyla'y! Haly! Vy. (3.59) 


yEVo x'y’EX Vo LEX xly’EXoY 


Normalize M = 1 without loss of generality. The competitive ju also satisfies 


2 DF Hoy +) Heo + >) Hoy = 1. (3.60) 


xryexXyY LEX yey 


Now introduce vector 6 = (4, dy),,,, such that 6 > 0 and )7, dz + DJ, dy = at Let us 
show that there exists a yu, and (U,V) that satisfy (3.58), (3.60) and a relaxed version of 


(3.59): 


Ss" Lay — B S- Pita fiehy = Ox and S- Mey — B S- Qyla'y! Haty! = dy. (3.61) 


yEVo x'y'’EX Vo LEX O x'y’EXoY 


Equations (3.58), (3.60), (3.61) are the optimality conditions for the following linear 


programming problem: 


max ye Only 
BE" wyeX Y (3.62) 


t (3.61). 


A solution jz and associated Lagrange multipliers (U,V) exist to problem (3.62) for any 


vector 0. 


We now look for 6 such that the optimum ju(d) satisfies 


Se Pity lay 0) Oe Ve-vand. (= 28 ye Quiciyitety (6) = oy Vy. 


x'y’EX Vo x'y’EXo Y 


(3.63) 
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Then the solution p(d) to (3.62) satisfies stationarity conditions (3.59) and normalization 
(3.60). 


Let y be the map such that 9.(6) = (1 — 8) Mex yy Pole'y'He'y (6) and py(d) = 
(1 — 2) ranvexoy Qula'y Hx'y'(6). In short p(d) = (1 — B)(Pu(6), Qu(6)). We are looking 
for a fixed point of y. Let us show that y has closed graph and that y(6) is non-empty and 


convex to apply Kakutani theorem. 


The graph if y is G, = { (@, (1— B)(Py, Qu) 8.t Voyexy Prybay = ro}, where f is 
defined through 
f(b) = mes S Daylley BAB. on} (3.64) 
ey ryEex Y 
G, is closed by continuity of function f. Next, y(d) is non-empty because there al- 
ways exist a solution to (3.62). Finally y(d) is convex: let (1 — 6)(Pu(6),Qu(d)), A - 
B)(Pu'(d),Qu'()) € yd), with w # pw’. Then (1 — t)(1 — B)(Py(6), Qu(d)) + #1 — 
8)(Pu'(4), Qu'(4)) € v(9). 


By Kakutani theorem, y admits a fixed point, and there exists 6 that satisfies (3.63). 


Theorem 3.1 
We follow Gretsky et al. (1992) and Galichon and Salanié (2021). Consider the following 


problem of minimizing the sum of individual welfare under stability conditions: 


aaa a a 


s.t ui + v; > Day, + iy + Ij 
+ BEp [u'**|aiy;] + BE [vay] Vt, a,y 
ui > +e9+ GEp [u’*"|x;0] Vie 
vy > M03 + BEg [v***|0y;] Vt,y. 


(3.65) 


Take any two (u’); and (v"),; such that ui, + vi, > Pry + BEp [maxycy, {ie + ey} |ry] + 
BEg [maxzex, ae + ite Izy], and tz = “0, uyo = 0. Define 


u; =max{u,,+eéy} Vt,c and v;=max{v,+nj} Vt,y. (3.66) 
, y 


Then (uj, v5): satisfy problem (3.65)’s constraints. Reciprocally, fix any (uj, vj); that satisfy 
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the constraints in this problem and let 


Uny = min {u;—ey}, uy =O and vi, = min {vi — maj}, Uby =0. (3.67) 
Yi 


1,0;=2 


Then the constraint in problem (3.65) becomes ut, +v},, > ®2ytGEp [maxyey, {ust! + ey} |ay]+ 
BEg [maxrex, Lo + The} Jzy]. 
Applying the law of large numbers, we obtain that problem (3.65) is equivalent to 
min SS m,G,(u') + S- ny Hy (v 
as LEX yey 
8-6 Uy + Vey 2 Pay + BEp [G,(u'*") |zy] + BEg [Ay (ve) zy] Viz,y 
ug > BEp [G,(u'*")|20] Vt,x 
Uy 2 BEg [H,(v'*") Oy] Vt, y, 


which is the dual (3.26) of social planner problem (3.20). To see this, rewrite (3.26) as a 
saddle point problem, and add the term 


d, (Some Gr(u') + )+ dum Ay ( )) im S- (= merG,(u) + Sita) 


y 
Theorem 3.2 


A stationary equilibrium’s matching policy under Gumbel shocks writes 


Day + x! Ue Pet\ey + rey Vy Qy'ley — Ux — V, 
Hay UsVornsn) = a exp ( yt Bdivex jey + B Liye VyQy'iey | 


2 


Lx (U, Vim, n) = Mg EXP (> UP ei\gy _ “ 


ulEX 


Hoy (U, V,m,n) = ny exp (> > VyQy'ay — “] ’ 
yey 
(3.68) 
where the dependence of matching policy jz on expected indirect payoffs (U,V) and 
margins (m,n) has been made explicit. To show that a stationary equilibrium exists, we 


must show that there exists a tuple (U,V,m,n) that satisfies feasibility constraint: 


S- fey (U, V,m, 2) = mz. and S- [i lOve) Hay. (3.69) 


yEVo LEX go 


169 


and transition rules 


S- Pyicyfey (U, V,m,n) =m, and S- Qaiieyliey (UU, Vom, 1h) = my. (3.70) 


ryecXoy ryEX Vo 


We proceed as follows: in a first step, we show that given (m,n) the map Finn : (U,V) > 


(U,V) defined through feasibility constraints is a contraction for the sup norm, and hence 


admits a fixed point (U(n,m),V(n,m). In a second step, we show that given (U,V), the 


map G : (m,n) + (m,n) is continuous on a convex compact. Hence by Brouwer’s theorem 


it admits a fixed point (n,m). The tuple (U(n,m), V(n, m), n,m) verifies the conditions for 


the stationary equilibrium. 


Step 1: Define map Fn: (U,V) > (U,V) as Fnn = Flo F?,, with 


(ClChg) ee = + OV) oy 
(FG,V)).~= 6 CO) x (3.71) 
(FU, VN =p (QV Joy» 


where (PU),,, is short for }0,. PojeyUx". F’ is continuous in (U,V). Map F,, : ® > (U,V) 


nym 


is implicitly defined through equations 


®,,, —U, — V, 
YS vitariex ( 2 5 ) Mz exp (®z9 — Uy) = Mz 


ad (3.72) 
©. Up Vy 
S- 4/ iy EXD 5 m4 exp (Poy — Vy) = Ny. 
LEX 
six @ly 5lx@ly 
Let S = Ix Oxxy where Jy is the identity matrix of size X = #4 and 


Oy xx ly 
1x is a column vector of ones of size X. The same goes for Jy and ly of size Y = #Y. Also 
let dry = Oxy +logn, + log my, Pro = lognz, Poy = logm,, then Fy rewrites in matrix form: 


S' exp(—Sp+¢)-—q=0, (3.73) 


where p = (U,V) and q = (m,n). 
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We can apply the implicit function theorem to get 
Dgp = [S' exp (—Sp+ ¢) S| S' exp(—Sp+ ¢). (3.74) 


We now aim to show that Dgp is bounded by 1. If Dgp is bounded by 1, then p is a Lipschitz 
continuous function of @ with Lipschitz constant 1. Then F;,,,, is a Lipschitz continuous map 


for any given m,n with Lipschitz constant (3. 


To show that Dgp is bounded, take w some vector of size X x Y + X +Y with ||w|| <k 
for some k > 0. Then 


(S| exp (—Sp + 4) w),< -> /NgMy EXP Ge ac oe Ys) k+nzexp(¢z9 —Uzr)k Va 


2 
(S' exp (—Sp + 4) w) > NzMy, EXP (eo) + myexp (doy —Vy)k Vy. 
yg Lay Vey y y — Vy 


(3.75) 
Therefore 


[S" exp(—Sp+ ®) S] ~" ST exp (—Sp+ 6) w < [k,...,k]" 
a (3.76) 

Bs | [ST exp (—Sp + 6) S]~' ST exp (—Sp + 6) w| > |Iwl, 
which concludes the proof that Dp is bounded by 1 and F;,,, is a Lipschitz continuous map 


for any given m,n with Lipschitz constant 8. Since 6 < 1, Fy» is a contraction mapping 
and it admits a fixed point. 


Step 2: Let U,V be a fixed point to Fim. Then since (n,m) + Fim is continuous, 
(n,m) + (U(n,m),V(n,m)) is also continuous in n,m. Besides the latter map is defined 
on a convex compact subset of R**” since }> ny + yy My = M. 


Define map G : (n,m) — (n,m) as 


G,(n,m) = S- Pree iea (U, V9n; n) 
xyeXoy 


G,(n,m) = i O viaje, (CO, VE): 


xLYEX Vo 


(3.77) 


By continuity of (n,m) > (U(n, m),V(n, m)), (n,m) > p(U,V,m,n) is continuous, 
hence G is continuous, and by Brouwer’s theorem it admits a fixed point. 
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Conclusion 


This dissertation aims at understanding returns to education and experience on labor 
markets, and understand how they are impacted by supply and demand changes. The first 
chapter evidences flattening returns to experience for high educated workers in France: av- 
erage yearly entry wage growth on the medium-term is 11.9% for the 1998 cohort of new 
labor market entrants, and only 8.6% for the 2010 cohort. It also evidences the high hetero- 
geneity of this flattening across occupations: some have benefitted from increased returns 
to an additional year of experience, while other have suffered a decline. More specifically, 
the occupations whose total share of the working population has increased are the ones that 
experience the most severe decline. This finding suggests an over-supply of workers leads 
to slower wage progression. I explore to mechanisms that could cause this effect: access to 
managerial positions and skill-occupation mismatch. I find promotions to managerial posi- 
tions are increasingly uncommon among the 2010 cohort, compared to the 1998 cohort. I 
also find that although initial mismatch has stayed the same between 1998 and 2010, its 
weight in determining medium-term wage levels has grown between the two cohorts. The 
second chapter focuses on the high school wage premium in Portugal. It starts with the ob- 
servation that the high school wage premium has decreased since the end of the 1980s, but 
to 50% for young workers. Two opposite interpretations could be given for this fact: trade 
or skill-biased technological change overtaken by an education expansion. I find that high 
school educated workers productivity has increased in all sectors of the Portuguese econ- 
omy between the 1980s and today, and that the contraction in high school wage premium is 
driven by the rise in the relative numbers of high school educated workers. Finally the third 
chapter explores dynamic matching games, and shows that under reasonable assumptions, 
the equilibrium can be characterized and computed through various methods. This chapter 
provides a backbone for further empirical investigations in the dynamics of labor markets. 

I plan on making this dissertation the basis for a broader research agenda on returns to 
education, experience, and wage inequality. My goal is to keep expanding on the tools from 
the matching literature to analyze labor markets through the lens of supply and demand 


equilibrium. 
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There are various issues I would like to explore next. First is the gender wage gap. A 
growing literature (Morchio and Moser (2019)) highlight the role of gender’s differentiated 
valuation for amenities offered by firms to explain the wage discrepancy between men and 
women. An important question I would like to investigate is amenity valuation interacts 
with education, experience, and firm production, and how much it can account for the 
gender pay gap. Another interesting question is the role of labor market institutions in wage 
inequality: how are supply and demand equilibrium affected by the minimum wage level, 
severance policies or union wage bargaining? Third, I plan on exploring the underlying role 
of unobserved worker and firm productivity in the impact of education expansions on wage 
levels. On the worker side, productivity varies depending on latent ability (Abowd et al. 
(1999), Bonhomme et al. (2019)), which might in turn affect returns to education. If they 
are aware of their ability, workers will sort into education depending on its return. This 
is likely to change the distribution of underlying ability among uneducated and educated 
workers in times of education expansion, which in turns impacts wage levels. On the firm 
side, there may be both individual firm and co-worker effects on productivity. Accounting 
for unobserved heterogeneity is therefore key to understand why observed wage distributions 


evolve over time. 
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Résumé 


Les marchés du travail sont un élément incontournable des sociétés occidentales du 21° 
siecle. Chaque individu en Europe et aux Etats-Unis a, A un moment donné de sa vie, une 
expérience sur ces marchés, soit en cherchant un emploi, soit en étant employé. Pas un 
jour ne passe sans que les médias et les gouvernements examinent et disseéquent le taux de 
chomage et les niveaux de salaire, considérés comme des mesures capitales de la santé de 
Véconomie. Dans le domaine universitaire, une grande partie de la recherche économique 
est également consacrée aux marchés du travail. Les questions relatives a impact du com- 
merce international et de la mondialisation, au niveau d’éducation de la main-d’ceuvre, a la 
détermination des salaires et au role des institutions (entre autres) offrent des espaces de 


recherche dynamiques. 


Cette thése vise 4 comprendre les marchés du travail d’un point de vue microéconomique : 
elle modélise les choix des individus et des entreprises sur le marché de ’emploi par une 
maximisation de l’utilité (pour les individus) ou des profits (pour les entreprises), ce qui, au 
niveau agrégé, se traduit par une offre et une demande de main-d’ceuvre. Elle utilise ensuite 
ce cadre pour décomposer les facteurs de détermination des salaires. Cette thése s’inscrit 
donc dans la tradition néoclassique, dans laquelle les salaires sont des quantités d’équilibre 
déterminées par l’offre et la demande. Cependant, elle s’écarte de la théorie de base telle 
que présentée par Hicks (1932) : en tenant compte des idiosyncrasies individuelles, en in- 
troduisant des individus et des entreprises hétérogénes et en reconnaissant une concurrence 
imparfaite sur les marchés du travail, on tient compte ici d’un cadre plus riche que la vision 
standard qui postule que les salaires sont simplement égaux a la productivité marginale des 


employés. 


Dans cette these, j’ai choisi de me concentrer sur un aspect crucial des marchés de 
Vemploi, a savoir les rendements salariaux de l’éducation et de l’expérience. On observe 
généralement que les individus diplomés du secondaire ou de l’université touchent un salaire 


plus élevé que les autres. Les individus ayant plus d’années d’expérience sur le marché du 
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travail sont aussi souvent mieux payés que leurs pairs moins expérimentés. Les rendements 
de l’éducation et de l’expérience se peuvent se combiner, en faveur des individus diplomés 
et expérimentés. Cependant, les écarts salariaux entre les travailleurs diplomés ou non (ou 
moins diplomés) et les travailleurs expérimentés et inexpérimentés varient dans le temps 
et dans l’espace. Comprendre pourquoi ces écarts existent en premier lieu et pourquoi ils 
different selon la décennie ou le pays est essentiel pour deux raisons. D’abord parce qu’ils 
sont des vecteurs importants d’inégalités : en 2016, les salaires représentaient 55 % du revenu 
total des ménages en France, et 72 % aux Etats-Unis (Rani and Furrer (2016)). Puisque les 
salaires sont la principale source de revenu des ménages dans la plupart des pays, l’inégalité 
des salaires entre les individus se traduit par une inégalité des revenus. ' (Autor (2014)). La 
deuxieme raison pour laquelle il est nécessaire de comprendre les écarts salariaux est qu’ils 
nous informent sur le fonctionnement des entreprises. En effet, la rémunération que les en- 
treprises acceptent de verser a leurs employés dépend de leurs besoins : toutes choses égales 
par ailleurs, une entreprise est préte a rémunérer beaucoup plus un travailleur qui possede 
un ensemble de compétences essentielles 4 son fonctionnement qu’un travailleur qui ne les 
posséde pas. L’augmentation de la prime a4 l’éducation (c’est-a-dire l’écart salarial moyen 
entre les travailleurs diplomés de l’université et les autres) entre la fin des années 1970 et le 
début des années 2000 aux Etats-Unis, au Royaume-Uni et au Canada (Krueger et al. (2010)), 
est interprétée de maniére convaincante par une vaste littérature comme la conséquence du 
skill-biased technological change, selon le terme anglophone, c’est-a-dire un changement tech- 
nologique en faveur des plus diplomés. Ainsi, la prime a l’éducation augmenterait en raison 
d’un changement dans la structure de production des entreprises qui accroit la productivité 
des individus dipl6més par rapport aux non-diplomés (Katz and Murphy (1992), Acemoglu 
(1998), Goldin and Katz (2008), Autor et al. (2020)). Récemment, la tendance de la prime & 
Véducation s’est aplatie, et elle a méme diminué dans certains pays européens (par exemple 
en Allemagne, en Italie, en Espagne). Ce changement est révélateur de vastes transforma- 
tions sur les marchés du travail ot il s’est produit, et pourrait étre dt a un effet de demande, 
a travers un retournement du changement technologique, soit 4 une modification de l’offre : 
tant |’Europe que les Etats-Unis ont connu d’importantes expansions éducatives au cours des 
50 derniéres années. Par conséquent, méme si la demande des entreprises pour les employés 
diplomés a persisté, leur plus grand nombre pourrait étre a l’origine de la baisse du salaire 
relatif. Comprendre comment |’offre et la demande interviennent dans la détermination des 


écarts salariaux est capital dans l’élaboration des politiques économiques et devrait faconner 


'C’est particulitrement vrai dans le premier 90° centile de la distribution des revenus : les revenus du 
capital sont concentrés au sommet de la distribution des revenus et constituent le principal moteur de 
Vinégalité entre les 90% les plus pauvres et les 10% les plus riches (Krueger et al. (2010)). 
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les politiques d’éducation et de production ?”. 


Distinguer les effets de l’offre et de la demande sur les niveaux de salaire est au coeur de 
cette these. Pour ce faire, elle développe un ensemble d’outils de modélisation des données 
qui empruntent a diverses branches de la littérature économique. Dans le premier chapitre, 
j’utilise des méthodes empiriques standards (la régression par moindres carrés ordinaires) 
pour évaluer impact moyen d’une ou plusieurs variables explicatives sur une variable de 
résultat. Dans le deuxiéme et troisiéme chapitre, je m’appuie sur la littérature en économétrie 
structurelle pour construire un modéle d’appariement entre les individus et les entreprises sur 
le marché du travail. Le modéle peut étre soit statique, comme dans le deuxieme chapitre, 
soit dynamique, comme dans le troisieme. Dans les deux cas, il intégre les individus qui 
maximisent leur utilité et les entreprises qui maximisent leur profit, les salaires agissant 
comme des transferts d’équilibre. Il est important de noter que les agents du modeéle sont 
hétérogenes dans de multiples dimensions : éducation, age ou profession pour les individus, 
secteur ou emplacement pour les entreprises. Comme les individus percoivent des niveaux 
d’utilité (hors salaire) différents en fonction de l’entreprise et que tous ne sont pas également 
productifs en fonction de leur employeur, le modéle géneére de riches distributions de salaires 
et d’appariement. Enfin, j’exploite des données d’appariement entre employés et employeurs 
qui contiennent des informations précises sur les profils des individus et entreprises, ainsi que 
sur les salaires versés. A l’aide de ces données, je suis en mesure d’estimer structurellement 
les modéles susmentionnés en utilisant l’appariement et les salaires observés, afin d’estimer 
les parametres gouvernant l’utilité des individus et le profit des entreprises sur le marché de 


Vemploi. 


Le premier chapitre explore empiriquement les mécanismes de carriére qui se traduisent 
par des progressions salariales différentes par cohorte, le deuxieme chapitre utilise des méthodes 
structurelles, a savoir les modeéles d’appariement, particulicérement adaptés a l’étude des 
effets d’offre et de demande au niveau agrégé, pour évaluer l’effet simultané d’une ex- 
pansion de l’éducation et d’un changement technologique sur les salaires. Le troisieme 
chapitre développe un modeéle d’appariement dynamique pour mesurer les attentes des agents 
sur les rendements futurs de leur appariement présent. Un bref résumé de chacun des 


chapitres de la thése suit. 


?Par exemple, le programme du lycée en France a été profondément réformé en 2019, entrainant une 
spécialisation renforcée des éléves. Il reste a évaluer si cela constitue un atout pour les nouveaux diplomés 
sur le marché du travail. 
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Chapitre 1. Le premier chapitre documente l’aplatissement des rendements de l’expérience 
pour les diplomés du supérieur en France entre 1998 et 2017. Je compare |’évolution 
des salaires 4 sept ans entre trois cohortes, ou générations, sorties du systeme scolaire ou 
supérieur en 1998, 2004 et 2010. Je documente la croissance moyenne des salaires par co- 
horte et par niveau d’éducation, et constate que pour les individus peu diplomés (n’ayant 
pas terminé le lycée ou diplomés du baccalauréat) le profil d’évolution du salaire moyen ne 
varie pas entre les générations. A inverse, les diplémés de l’enseignement supérieur profi- 
tent d’une croissance de salaire moyen plus forte en début de carriére, mais cette croissance 
s’aplatit entre la cohorte de 1998 et la cohorte de 2010. 

L’économie et le systeme éducatif francais ont connu des changements importants au 
cours de la période 1998-2017. Les cohortes 1998, 2004 et 2010 entrent donc sur le marché 
du travail dans des conditions sensiblement différentes : la cohorte 1998 est confrontée a un 
chomage élevé (supérieur 4 10%), mais compte relativement peu de diplomés du supérieur. 
La cohorte 2004 bénéficie d’un faible taux de chomage et d’une forte demande des entreprises, 
mais compte davantage de diplomés de l’enseignement supérieur. Enfin, la cohorte de 2010 
entre sur le marché du travail au milieu de la Grande Récession, et fait face a un fort taux 
de chomage et a une faible croissance. Parce que l’expansion de l'éducation francaise est 
encore forte dans les années 2000, encouragée par la création de licences professionnelles 
et la mise en ceuvre du processus de Bologne, la cohorte 2010 compte nettement plus de 
diplomés du supérieur que ses prédécesseurs. Cela risque de nuire a leurs perspectives sur le 
marché du travail a plusieurs niveaux : premiérement, les jeunes de la génération 2010 sont 
confrontés a une baisse de la demande des entreprises. Ensuite, les diplomés du supérieur 
sont plus nombreux qu’auparavant. L’impact de ce second élément peut étre pensé sous 
plusieurs angles : (Gaini et al. (2013) ; Dupray and Moullet (2010)). Premiérement, si 
le diplome est un signal de productivité, ’augmentation du nombre de diplomés implique 
alors une diminution de leur productivité individuelle moyenne, qui peut se traduire par une 
progression plus lente des salaires. Une seconde approche considére le diplobme comme un 
moyen d’acquérir du capital humain. Les rendements de |’éducation repose alors sur cette 
acquisition. La diversification du systeme d’enseignement supérieur frangais, en modifiant le 
contenu des diplomes, a pu avoir un impact négatif sur l’acquisition du capital humain des 
jeunes diplomés. Enfin, une troisieme approche basée sur le modéle néoclassique standard 
prédit une baisse du salaire des jeunes diplomés si leur nombre augmente simplement parce 
que le salaire est égal au produit marginal d’un travailleur supplémentaire. Si les entreprises 
produisent avec des rendements d’échelle décroissant, chaque travailleur supplémentaire fait 
diminuer le salaire moyen. 


Ce chapitre s’attache donc a étudier empiriquement les raisons de la croissance différenciée 
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des salaires moyens en France depuis la fin des années 1990. Pour cela, j’utilise les enquétes 
‘Générations’ mises a disposition par le CEREQ (Centre d’Etudes et de Recherche sur les 
Qualifications). Les enquétes sont présentées sous forme de données de panel et couvrent la 
vie active des sortants du systéme scolaire ou supérieur en 1998, 2004 et 2010 pendant sept 
ans, et fournissent une vision globale de l’insertion des jeunes sur le marché du travail frangais. 
Je décompose d’abord les différences de croissance des salaires moyens par catégorie sociopro- 
fessionnelle (PCS) en une marge extensive et une marge intensive. La marge extensive résulte 
d’un effet de composition di a une évolution de la part représentée par chaque PCS entre 
les cohortes. La marge intensive repose sur la variation de la croissance annuelle des salaires 
par PCS entre cohortes. Les PCS qui affichent une marge intensive négative sont également 
celles pour lesquelles la marge extensive est la plus grande. En effet, les PCS qui accusent le 
ralentissement le plus important de la progression des salaires sont également celles qui con- 
naissent le plus grand afflux de diplomés entre 1998 et 2010. Cette observation est conforme 
a une interprétation du ralentissement de la croissance des salaires en termes d’offre et de 
demande, selon laquelle une offre excédentaire de nouveaux diplomés les empéche d’atteindre 
les niveaux de salaire de leurs prédécesseurs. Une telle interprétation suggere d’explorer les 
mécanismes par lesquels une augmentation de l’offre de diplomés, conjuguée a une stagnation 
de la demande, influe sur la dynamique des salaires en début de carriere. Ce chapitre en 
explore deux : la promotion a des postes de d’encadrement d’une équipe, ou de manager, 
et Vinadéquation entre spécialité du diplome et PCS. Les enquétes ‘Générations’ montrent 
que l’obtention d’un poste de manager s’accompagne d’une augmentation de salaire 4 moyen 
terme. Ainsi, une diminution de la probabilité d’obtenir un tel poste ralentit la progression 
globale des salaires. Ceci est cohérent avec les conclusions de Kwon et al. (2010). J’examine 
ensuite l’argument de Liu et al. (2016), qui montrent qu’aux Etats-Unis, les diplémés de 
Vuniversité ont souffert d’une adéquation diplome-industrie dégradée pendant la Grande 
Récession, ce qui a conduit a des niveaux de salaire constamment inférieurs a ceux de leurs 
pairs plus agés. En France, je n’observe pas d’aggravation de l’inadéquation (définie comme 
le niveau de salaire moyen en premiere année d’une spécialité de diplome donnée au sein 
d’une PCS) entre 1998 et 2010 pour les diplomés du supérieur, mais je constate que son 
importance dans la détermination des salaires futurs s’est accrue entre Générations 1998 et 
2010. 


Chapitre 2. Entre les années 1970 et aujourd’hui, le nombre d’individu diplomés a fortement 
augmenté dans de nombreux pays. En conséquence, le rapport entre effectifs des individus 
diplomés et les effectifs des non-diplomées a augmenté sur le marché du travail. Ce chapitre 


cherche & comprendre comment cette augmentation a impacté l’appariement des individus 
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et des entreprises, c’est-a-dire quel type d’entreprise embauche quel type d’individu, ainsi 
que son impact sur les salaires, en utilisant un nouveau modeéle d’appariement sur le marché 
du travail. Le modeéle est structurellement estimé sur des données portugaises appariées 
employeurs-employés. Ce faisant, je suis en mesure de quantifier l’impact des changements 
de l’offre et de la demande sur l’allocation travailleur-entreprise et la structure des salaires. 
Pour capturer les mécanismes d’offre (des individus sur le marché de |’emploi) et de de- 
mande (des entreprises) sur le marché du travail, ce chapitre construit un modéle d’appariement 
statique dans lequel une entreprise s’apparie avec plusieurs employés, et leur transfére un 
salaire (modéle avec utilité transférable). Les individus et les entreprises different de par 
leurs caractéristiques observées, qui sont résumées par un type multidimensionnel, ainsi que 
par un choc stochastique qui rend compte de l’hétérogénéité non observée. Une seule en- 
treprise embauche plusieurs travailleurs, qui forment sa main-d’ceuvre. Le surplus créé par 
Vappariement dépend des caractéristiques observables des entreprises ainsi que de celles de la 
main-d’ceuvre. L’utilité est transférable sous la forme de salaires versés par l’entreprise a ses 
employés. Les entreprises cherchent & maximiser le profit total, qui est constitué de la pro- 
duction de la main-d’ceuvre plus un choc aléatoire, moins les salaires versés. Les travailleurs 
maximisent leur utilité, qui se compose du salaire, plus une part non monétaire de préférence 
pour le type de l’entreprise et un choc aléatoire. A V’équilibre, les salaires équilibrent le 
marché et chaque agent s’apparie a sa meilleure option compte tenu des salaires. Le modéle 
peut générer une riche distribution des salaires qui dépend a la fois des caractéristiques ob- 
servables des individus et de l’entreprise, ainsi que de la main-d’ceuvre employée. I] prédit 
également un appariement d’équilibre, qui est la distribution conjointe des entreprises et 
de la main-d’ceuvre. En utilisant a la fois l’appariement et les salaires, le modele identifie 
séparément la production de l’entreprise et lutilité non monétaire percue par les individus. 
Le cadre développé dans ce chapitre offre plus de flexibilité dans l’estimation que les 
modeéles classiques d’offre et de demande de Katz and Murphy (1992) et Card and Lemieux 
(2001) : il identifie les préférences des individus en plus de la production de l’entreprise, et 
permet aux paramétres de varier dans le temps. En effet, en modélisant explicitement les 
choix d’appariement des entreprises et des individus, on peut utiliser a la fois l’appariement 
observé et les salaires observés, ce qui augmente les possibilités d’identification. Le modéle 
est estimé sur les données en supposant des fonctions paramétriques pour la production 
de l’entreprise et Vutilité non monétaire des individus. Les individus sont classés en deux 
niveaux d’éducation, les diplomés du secondaire et les non-diplomés, et en trois groupes 
d’age, jeunes (16-34 ans), moyen (35-54 ans) et seniors (plus de 55 ans). Les entreprises se 
différencient par leur secteur d’activité. Conformément 4 la littérature, la production est une 


fonction imbriquée d’élasticité constante de substitution (CES), avec des paramétres de pro- 
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ductivité pour chaque niveau d’éducation qui varient d’un secteur a l’autre. Je suppose que 
les préférences des individus pour les entreprises dépendent de l’age, du niveau d’éducation 
et du secteur de l’entreprise. En utilisant des prédictions du modéle pour l’appariement et 
les salaires, on l’estime structurellement sur des données appariées employeur-employé par 
maximum de vraisemblance sur la distribution conjointe de l’appariement et des salaires, 
séparément tous les trois ans. 

Le modele développé dans ce chapitre est lié a la fois aux problemes d’affectation de 
plusieurs biens a un agent étudiés dans la littérature de ‘market design’, ou conception de 
marché, (Bikhchandani and Ostroy (2002), Vohra (2011)), et aux modéles d’appariement 
utilisés en économie de la famille (Choo and Siow (2006)). Le modéle développé dans ce 
chapitre comble l’écart entre ces deux littératures : il étend les affectations unilatérales a 
Vappariement bilatéral et généralise l’appariement entre deux agents a un appariement entre 
plus de deux agents. De plus, le cadre économétrique de Choo and Siow (2006) et Galichon 
and Salanié (2021) est étendu aux modéles d’appariement multiple. 

Ce chapitre utilise le nouveau cadre théorique développé pour étudier le marché du travail 
portugais entre 1987 et 2017. Trois faits sont mis en évidence : premiérement, le pays opére 
une vaste expansion de sa population éduquée au cours de la période, ce qui se traduit par 
une augmentation spectaculaire de l’offre de diplomés du secondaire par rapport aux non- 
diplomés sur le marché du travail. Deuxiemement, le rendement de |’éducation diminue au 
cours de la période. Le rendement de l’éducation est défini comme |’écart salarial moyen entre 
les individus ayant obtenu leur diplome d’études secondaires et ceux ne l’ayant pas obtenu. 
La diminution du rendement de l’éducation est particuliérement marquée chez les jeunes. 
Deux interprétations s’opposent pour expliquer ce fait. Premierement, la diminution des 
rendements de ’éducation pourrait étre la conséquence d’un effet commercial : le Portugal 
a rejoint Union européenne en 1986, et parce que le pays compte relativement plus de 
travailleurs non éduquées (qui n’ont pas fréquenté l’école secondaire) dans sa population 
active que les autres pays de l’UE, un modeéle a la Heckscher-Ohlin prédit une augmentation 
des exportations de biens dont la production nécessite une main-d’ceuvre non éduquée. La 
demande relative de main-d’ceuvre non diplomée par rapport a la main-d’ceuvre diplomée 
augmente et les rendements de |’éducation diminuent. La seconde interprétation repose sur 
un effet d’offre : méme si la demande relative de main-d’ceuvre diplo6mée par rapport a la 
main-d’ceuvre non diplomée augmente, la formidable expansion de la population diplomée 
qui s’est produite au Portugal dans les années 1990 et 2000 pourrait diminuer la productivité 
marginale des travailleurs diplomés et entrainer une baisse des rendements de |’éducation. Le 
modele décrit ci-dessus est en mesure de distinguer les deux interprétations possibles. Enfin, 


on observe que la répartition des diplomés du secondaire par rapport aux non-diplomés 
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entre les secteurs d’activité des entreprises devient de plus en plus déséquilibrée, en faveur 
des secteurs de services, transports et communications, qui emploient une part croissante de 
diplomés du secondaire. Les deux premiers faits impliquent que l’offre relative de diplomés 
du secondaire par rapport aux non-diplomés a augmenté plus rapidement que la demande 
relative des entreprises pour les diplomés du secondaire par rapport aux non-diplomés. Ce 
dernier suggere que l’appariement entre salariés et entreprises a évolué sur la période : soit 
parce que les entreprises des services, transports et communications nécessitent une part 
croissante de diplomés du secondaire, soit parce que la préférence des diplomés du secondaire 
pour ces entreprises se renforce. 

Apres estimation du modeéle d’appariement, je constate que la demande relative de 
diplomés du secondaire dans les entreprises des secteurs des services, de l’industrie man- 
ufacturiére et des transports et communications a considérablement augmenté au cours de 
la période, en particulier depuis le début des années 2010. Ce résultat est conforme a 
Vhypothése de changement technologique biaisé en faveur des plus diplomées, plutot qu’a 
un effet du commerce avec le reste de ’UE : il suggére que ’augmentation de la demande 
relative de main-d’ceuvre diplomée par rapport a la main-d’ceuvre non diplomée, favorisée 
par le changement technologique, est contrebalancée par l’augmentation de l’offre relative. 
Je constate également que la préférence des diplomés du secondaire de moins de 55 ans 
pour les secteurs de service, d’industrie manufacturiére et de transports et communication 
a diminué, tandis que la part des moins de 55 ans dans la production des entreprises aug- 
mente par rapport aux salariés plus agés. En plus du cadre classique de l’offre et de la 
demande, le modéle offre deux mécanismes supplémentaires d’évolution des rendements de 
Vexpérience. Premié¢rement, une baisse de l’utilité non monétaire des individus a pour ef- 
fet d’augmenter les salaires. Deuxiémement, la demande relative des entreprises pour les 
différentes classes d’Age varie dans le temps. J’effectue plusieurs exercices contrefactuels 
pour évaluer les actions distinctes des changements dans la démographie des travailleurs (a 
la fois dans l’éducation et la répartition par age), la composition par secteur des entreprises, 
les parametres de production et Vutilité non monétaire des individus, sur l’appariement et 
les rendements de l’expérience. Il apparait que les changements démographiques sont le 
principal moteur des changements d’appariement. Les changements dans la composition 
de l’industrie, la demande des entreprises et l’utilité non monétaire des individus ont un 
effet modeste. Les rendements de l’éducation par tranche d’age et par branche d’activité 
sont affectés négativement par les changements dans la démographie des travailleurs et la 
composition des secteurs d’activité (c’est-a-dire l’augmentation de la part des services) et 
positivement par les variations de la demande des entreprises. Ainsi les changements de 


productivité relative en faveur des diplomés du secondaire ont fait augmenter les rendements 
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de l’éducation, mais ne peuvent pas compenser la forte augmentation de offre relative de 


diplomés par rapport aux non-diplomés. 


Chapitre 3. Co-écrit avec Jeremy Fox et Alfred Galichon. Ce chapitre adopte une per- 
spective différente des deux premiers : au lieu d’étudier l’appariement des individus et des 
entreprises dans un monde statique, il explore comment des considérations dynamiques in- 
fluencent l’appariement. En effet, sur de nombreux marchés d’appariement, y compris le 
marché du travail mais aussi le marché matrimonial, les agents tiennent compte du fait que 
leur type peut évoluer dans le temps, soit de maniére déterministe (par exemple, l’age des 
individus), soit en fonction de l’agent ou la personne avec laquelle ils se sont appariés (si un 
employé travaille dans une profession donnée, il accumulera du capital humain dans cette 
profession). Pour comprendre comment ces considérations influencent les choix des parte- 
naires ou de l’employeur/employé, nous développons un modeéle d’appariement dynamique. 
Les agents ont des types individuels, tels que l'éducation et l’expérience pour les salariés, 
et Vindustrie et la profession pour les emplois. Lorsqu’ils décident avec qui s’apparier, les 
agents tiennent compte des rendements futurs attendus qui découlent d’un changement de 
type. A son tour, ce changement de type affectera les retours des appariements futurs. A 
chaque période, le marché s’équilibre via les salaires. 

L’objectif de ce chapitre est de développer un modeéle prét a l’emploi utile de jeux 
d’appariement répétés qui généralise les jeux d’appariement statiques a un cadre dynamique. 
Il differe également des deux chapitres précédents parce que le modéle développé n’est 
pas seulement applicable aux questions de travail, mais pourrait également s’appliquer en 
économie de la famille ou a l’organisation industrielle. Nous introduisons également des 
chocs stochastiques, ou erreurs économétriques, pour prendre en compte Vhétérogénéité in- 
observée dans les données. Le jeu d’appariement répété avec des erreurs économétriques 
s’apparente a une combinaison de deux articles de référence dans la littérature : Choo and 
Siow (2006) proposent un estimateur pour les jeux d’appariement statiques avec des er- 
reurs logistiques, et Rust (1987) propose un estimateur pour les modéles de choix discrets 
dynamiques a agent unique, utilisant aussi des erreurs logistiques. Dans ce chapitre, nous 
combinons les deux pour obtenir un estimateur pour les jeux d’appariement dynamiques. 

Dans notre cadre d’appariement répété, chaque agent a une variable d’état, ou type. 
S’apparier, ou rester sans partenaire, peut affecter l’évolution de cette variable d’état. A 
chaque période, les agents participent 4 un marché d’appariement avec des prix ou des 
transferts pour différents appariements. Compte tenu des prix d’équilibre du marché, chaque 
agent sélectionne le meilleur partenaire en prenant en compte son utilité future en fonction 


d’éventuel changement de sa variable d’état. La période suivante, le marché correspon- 
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dant rouvre, et de nouvelles correspondances se forment selon les nouveaux prix. Un jeu 
d’appariement répété peut avoir a la fois une dynamique individuelle et agrégée. Au niveau 
individuel, chaque agent résout un probleme de programmation dynamique a agent unique, 
ou chaque période, l’action de l’agent consiste a choisir un partenaire avec qui s’apparier. 
Au niveau agrégé, la variable d’état du marché correspondant est l’ensemble actuel de types 
d’agents actifs, ou variables d’état. Cette variable d’état agrégée évolue avec les décisions des 
agents individuels et se résume par une équation de Bellman au niveau de l’économie, résolue 
par un planificateur social. Nous développons d’abord le modéle sans erreur économétrique, 
puis en tenant compte des préférences individuelles sous forme de choc économétrique. Dans 
les deux cas, nous explorons deux méthodes différentes pour calculer |’équilibre agrégé : une 
méthode repose sur |’itération de la fonction de valeur du planificateur social sur une grille 
pour calculer sa valeur, et l’équilibre associé sur chaque point de la grille, et la seconde 
méthode utilise des réseaux de neurones pour minimiser une fonction de perte qui approche 
la recherche de point fixe de |’équation de Bellman du planificateur social. 

L’un de nos résultats théoriques les plus importants est qu’il existe un équilibre sta- 
tionnaire, a la fois avec et sans chocs économétriques : il existe une masse de variables 
d’état d’agent telle que, aprés que les appariements optimaux ont été choisis par les agents, 
les memes masses de type d’agents sont présentes a la période suivante. L’existence d’un 
équilibre stationnaire ne dépend pas des paramétres du modeéle et permet éventuellement au 
chercheur d’ignorer la dynamique agrégée en imposant que le jeu d’appariement soit a un 
équilibre stationnaire. En nous concentrant sur |’équilibre stationnaire, nous introduisons en- 
core deux autres méthodes pour le calculer : l’une résout le systeme d’équations de |’équilibre 
stationnaire en utilisant un solveur de programmation non linéaire. La deuxieme méthode 
reformule le probleme de la recherche d’un équilibre stationnaire comme un probleme min- 
max et utilise l’algorithme primal-dual de Chambolle-Pock pour le résoudre. Nous montrons 
que ces deux méthodes peuvent s’adapter 4 des problemes avec de nombreux types d’agents. 
En plus de calculer un équilibre stationnaire, nous pouvons étendre les mémes estimateurs 
pour estimer structurellement les paramétres dans la production d’une correspondance avec 


un ensemble de données approprié. 
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